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1. 


INTRODUCTION 


The  Summer  Research  Program  (SRP),  sponsored  by  the  Air  Force  Office  of  Scientific  Research 
(AFOSR),  offers  paid  opportunities  for  university  faculty,  graduate  students,  and  high  school  students 
to  conduct  research  in  U.S.  Air  Force  research  laboratories  nationwide  during  the  summer. 

Introduced  by  AFOSR  in  1978,  this  innovative  program  is  based  on  the  coiKept  of  teaming  academic 
researchers  with  Air  Force  scientists  in  the  same  disciplines  using  laboratory  facilities  and  equipment 
not  often  available  at  associates'  institutions. 

The  Summer  Faculty  Research  Program  (SFRP)  is  open  annually  to  approximately  150  faculty 
members  with  at  least  two  years  of  teaching  and/or  research  experience  in  accredited  U.S.  colleges, 
universities,  or  technical  institutions.  SFRP  associates  must  be  either  U.S.  citizens  or  permanent 
residents. 

The  Graduate  Student  Research  Program  (GSRP)  is  open  annually  to  ^roximately  100  graduate 
students  holding  a  bachelor's  or  a  master's  degree;  GSRP  associates  must  be  U.S.  citizens  enrolled  full 
time  at  an  accredited  institution. 

The  High  School  Apprentice  Program  (HSAP)  annually  selects  about  125  high  school  students  located 
within  a  twenty  mile  commuting  distance  of  {larticipating  Air  Force  laboratories. 

AFOSR  also  offers  its  research  associates  an  qpportunity,  under  the  Summer  Research  Extension 
Program  (SREP),  to  continue  their  AFOSR-sponsored  research  at  their  home  institutions  through  the 
award  of  research  grants.  In  1994  the  maximum  amount  of  each  grant  was  increased  from  $20,000  to 
$25,000,  and  the  number  of  AFOSR-sponsored  grants  decreased  from  75  to  60.  A  separate  annual 
report  is  compiled  on  the  SREP. 

The  numbers  of  projected  summer  research  participants  in  each  of  the  three  categories  and  SREP 
“grants”  are  usually  increased  through  direct  sponsorship  by  participating  laboratories. 

AFOSR's  SRP  has  well  served  its  objectives  of  building  critical  links  between  Air  Force  research 
laboratories  and  the  academic  community,  caning  avenues  of  communications  and  forging  new 
research  relationships  between  Air  Force  and  academic  technical  experts  in  areas  of  national  interest, 
and  strengthening  the  nation's  efforts  to  sustain  careers  in  science  and  engineering.  The  success  of  the 
SRP  can  be  gauged  from  its  growth  from  inception  (see  Table  1)  and  from  the  favorable  responses  the 
1997  participants  expressed  in  end-of-tour  SRP  evaluations  (Appendix  B). 

AFOSR  contracts  for  administration  of  the  SRP  by  civilian  contractors.  The  contract  was  first 
awarded  to  Research  &  Develqiment  Laboratories  (RDL)  in  Sqitember  1990.  After  completion  of  the 
1990  contract,  RDL  (in  1993)  won  the  recompetition  for  the  basic  year  and  four  1-year  options. 
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2. 


PARTICIPATION  IN  THE  SUMMER  RESEARCH  PROGRAM 


The  SRP  began  with  faculty  associates  in  1979;  graduate  students  were  added  in  1982  and  high  school 
students  in  1986.  The  following  table  shows  the  number  of  associates  in  the  program  each  year. 


YEAR 

SRP  Participation,  by  Year 

TOTAL 

SFRP 

GSRP 

HSAP 

1979 

70 

70 

1980 

87 

87 

1981 

87 

87 

1982 

91 

17 

108 

1983 

101 

53 

154  1 

1984 

152 

84 

236  1 

1985 

154 

92 

246 

1986 

158 

100 

42 

300 

1987 

159 

101 

73 

333 

1988 

153 

107 

101 

361 

1989 

168 

102 

103 

373 

1990 

165 

121 

132 

418 

1991 

170 

142 

132 

444  I 

1992 

185 

121 

159 

464 

1993 

187 

117 

136 

440 

1994 

192 

117 

133 

442 

1995 

190 

115 

137 

442  1 

1996 

188 

109 

138 

435 

1997 

148 

98 

140 

427  1 

1998 

85 

40 

88 

213  1 

Beginning  in  1993,  due  to  budget  cuts,  some  of  the  laboratories  weren’t  able  to  afford  to  fund  as  many 
associates  as  in  previous  years.  Since  then,  the  number  of  funded  positions  has  remained  fairly 
constant  at  a  slightly  lower  level. 


3. 


RECRUITING  AND  SEUECTION 


The  SRP  is  conducted  on  a  nationally  advertised  and  competitive-selection  basis.  The  advertising  fw 
faculty  and  graduate  students  consisted  primarily  of  the  mailing  of  8,000  52-page  SRP  brochures  to 
chairpersons  of  dq)artments  relevant  to  AFOSR  research  and  to  administrators  of  grants  in  accredited 
universities,  colleges,  and  technical  institutions.  Historically  Black  Colleges  and  Universities 
(HBCUs)  and  MincHity  Institutions  (Mis)  were  included.  Brochures  also  went  to  all  participating 
USAF  laboratories,  the  previous  year's  participants,  and  numerous  individual  requesters  (over  ICXX) 
annually). 

RDL  placed  advertisements  in  the  following  publications:  Black  Issues  in  Higher  Education,  Winds  of 
Change,  and  IEEE  Spectrum.  Because  no  participants  list  either  Physics  Today  or  Chemical  &. 
Engineering  News  as  being  their  source  of  learning  about  the  program  for  the  past  several  years, 
advertisements  in  these  magazines  were  dropped,  and  the  funds  were  used  to  cover  increases  in 
brochure  printing  costs. 

High  school  ^plicants  can  participate  only  in  laboratories  located  no  more  than  20  miles  from  their 
residence.  Tailored  brochures  on  the  HSAP  were  sent  to  the  head  counselors  of  180  high  schools  in 
the  vicinity  of  particq)ating  laboratories,  with  instructions  for  publicizing  the  program  in  their  schools. 
High  school  students  selected  to  serve  at  Wright  Laboratory's  Armament  Directorate  (Eglin  Air  Force 
Base,  Florida)  serve  eleven  weeks  as  opposed  to  the  eight  weeks  normally  worked  by  high  school 
students  at  all  other  partic^ating  laboratories. 

Each  SFRP  or  GSRP  qjplicant  is  given  a  first,  second,  and  third  choice  of  laboratory.  High  school 
students  who  have  more  than  oik  laboratory  or  directorate  near  their  homes  are  also  given  first, 
second,  and  third  choices. 

Laboratories  make  their  selections  and  prioritize  their  nominees.  AFOSR  then  determines  the  number 
to  be  funded  at  each  laboratory  and  approves  laboratories'  selections. 

Subsequently,  laboratories  use  their  own  funds  to  sponsor  additional  candidates.  Some  selectees  do 
not  accept  the  appointment,  so  alternate  candidates  are  chosen.  This  multi-step  selection  procedure 
results  in  some  candidates  being  notified  of  their  acceptance  after  scheduled  deadlines.  The  total 
applicants  and  participants  for  1998  are  shown  in  this  table. 
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1  1998  Applicants  and  Participants 

PARTICIPANT 

TOTAL 

SLECTEES 

DECLINING 

CATEGORY 

AH»UCANTS 

SEIZCTEES 

SFRP 

382 

85 

13 

(HBCU/MI) 

(0) 

(0) 

(0) 

GSRP 

130 

40 

7 

(HBCU/MI) 

(0) 

(0) 

(0) 

HSAP 

328 

88 

22 

TOTAL 

840 

213 

42 

4.  SITE  VISITS 

During  June  and  July  of  1998,  representatives  of  both  AFOSR/NI  and  RDL  visited  each  participating 
laboratory  to  provide  briefings,  answer  questions,  and  resolve  problems  for  both  laboratory  personnel 
and  participants.  The  objective  was  to  ensure  that  the  SRP  would  be  as  constructive  as  possible  for  all 
participants.  Both  SRP  participants  and  RDL  representatives  found  these  visits  beneficial.  At  many  of 
the  laboratories,  this  was  the  only  c^portunity  for  all  participants  to  meet  at  one  time  to  share  their 
experiences  and  exchange  ideas. 


5.  HISTORICALLY  BLACK  COLLEGES  AND  UMVERSITIES  AND  MINORITY 
INSTITUTIONS  (HBCU/MIs) 

Before  1993,  an  RDL  program  representative  visited  fiom  seven  to  ten  different  HBCU/MIs  annually 
to  promote  intwest  in  the  SRP  among  the  faculty  and  graduate  smdents.  These  efforts  were  marginally 
effective,  yielding  a  doubling  of  HBCI/MI  t^plicants.  In  an  effort  to  achieve  AFOSR’s  goal  of  10% 
of  all  applicants  and  selectees  being  HBCU/MI  qualified,  the  RDL  team  decided  to  try  other  avenues 
of  approach  to  increase  the  number  of  qualified  applicants.  Through  the  combined  efforts  of  the 
AFOSR  Program  Office  at  Bolling  AFB  and  RDL,  two  very  active  minority  groiq)s  were  found, 
HACU  (Hispanic  American  Colleges  and  Universities)  and  AISES  (American  Indian  Science  and 
Engineering  Society).  RDL  is  in  communication  with  representatives  of  each  of  these  organizations  on 
a  monthly  basis  to  keep  up  with  the  their  activities  and  special  events.  Both  organizations  have 
widely-distributed  magazines/quarterlies  in  which  RDL  placed  ads. 

Since  1994  the  number  of  both  SFRP  and  GSRP  HBCU/MI  applicants  and  participants  has  increased 
ten-fold,  from  about  two  dozen  SFRP  applicants  and  a  half  dozen  selectees  to  over  100  applicants  and 
two  dozen  selectees,  and  a  half-dozen  GSRP  applicants  and  two  or  three  selectees  to  18  applicants  and 
7  or  8  selectees.  Since  1993,  the  SFRP  had  a  two-fold  applicant  increase  and  a  two-fold  selectee 
increase.  Since  1993,  the  GSRP  had  a  three-fold  tq^lkant  increase  and  a  three  to  four-fold  increase  in 
selectees. 
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In  addition  to  RDL's  special  recruiting  efforts,  AFOSR  attempts  each  year  to  obtain  additional  funding 
or  use  leftover  fundii^  from  cancellations  the  past  year  to  fund  HBCU/MI  associates. 


SRP  HBCU/MI  Participation,  By  Year 

YEAR 

SFRP 

GSRP 

Applicants 

Participants 

Applicants 

Participants 

1985 

76 

23 

15 

11 

1986 

70 

18 

20 

10 

1987 

82 

32 

32 

10 

1988 

53 

17 

23 

14 

1989 

39 

15 

13 

4 

1990 

43 

14 

17 

3 

1991 

42 

13 

8 

5 

1992 

70 

13 

9 

5 

1993 

60 

13 

6 

2 

1994 

90 

16 

11 

6 

1995 

90 

21 

20 

8 

1996 

119 

27 

18 

7 

6.  SRP  FUNDING  SOURCES 

Funding  sources  for  the  1998  SRP  were  the  AFOSR-provided  slots  for  the  basic  contract  and 
laboratory  funds.  Funding  sources  by  category  for  the  1998  SRP  selected  participants  are  shown  here. 
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I  1998  SRP  FUNDING  CATEGORY 

SFRP 

GSRP 

HSAP 

j  AFOSR  Basic  Allocation  Funds 

67 

38 

75 

1  USAF  Laboratory  Funds 

17 

2 

13 

1  Slots  Added  by  AFOSR 
(Leftover  Funds) 

0 

0 

0 

HBCU/MI  By  AFOSR 
(Using  Procured  AddnT  Funds) 

0 

0 

N/A 

1  TOTAL 

84 

40 

88 

7.  COMPENSATION  FOR  PARTICIPANTS 


Compensation  for  SRP  participants,  per  fhe-day  work  week,  is  shown  in  this  table. 


1998  SRP  Associate  Compensation 


PARTICIPANT  CATEGORY 

1991 

1992 

1993 

1994 

1995 

19% 

1997 

1998 

Faculty  Members 

$690 

$718 

$740 

$740 

$740 

$770 

$770 

$793 

Graduate  Student 
(Master's  Degree) 

$425 

$442 

$455 

$455 

$455 

$470 

$470 

$484 

Graduate  Student 
(Bachelor's  Degree) 

$365 

$380 

$391 

$391 

$391 

$400 

$400 

$412 

High  School  Student 
(First  Year) 

$200 

$200 

$200 

$200 

$200 

- 1 

$200 

$200 

$200 

High  School  Student 
(Subsequent  Years) 

$240 

$240 

$240 

$240 

$240 

$240 

$240  1  $240 

1 

i 

{ 

The  program  also  offered  associates  whose  homes  were  more  than  50  miles  from  the  labwatory  an 
expense  allowance  (seven  days  per  week)  of  $52/day  for  faculty  and  $41/day  for  graduate  students. 
Transportation  to  the  laboratory  at  the  beginning  of  their  tour  and  back  to  their  home  destinations  at 
die  end  was  also  reimbursed  for  these  participants.  Of  the  combined  SFRP  and  GSRP  associates, 

65  %  claimed  travel  reimbursements  at  an  average  round-trip  cost  of  $730. 

Faculty  members  were  encouraged  to  visit  their  laboratories  before  their  summer  tour  began.  All  costs 
of  these  orientation  visits  were  reimbursed.  Forty-three  percent  (85  out  of  188)  of  faculty  associates 
took  orientation  tr^  at  an  average  cost  of  $449.  By  contrast,  in  1993,  58  %  of  SFRP  associates 
elected  to  take  an  orientation  visits  at  an  average  cost  of  $685;  that  was  the  highest  percentage  of 
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associates  opting  to  take  an  orientation  trip  since  RDL  has  administered  the  SRP,  and  the  highest 
average  cost  of  an  orientation  trip. 


Program  participants  submitted  biweekly  vouchers  countersigned  by  their  laboratory  research  focal 
point,  and  RDL  issued  paychecks  so  as  to  arrive  in  associates’  hands  two  weeks  later. 

This  is  the  third  year  of  using  direct  deposit  for  the  SFRP  and  GSRP  associates.  TTie  process  went 
much  more  smoothly  with  respect  to  obtaining  required  information  from  the  associates,  about  15%  of 
the  associates’  information  needed  clarification  in  order  fw  direct  deposit  to  properly  function  as 
opposed  to  7%  from  last  year.  The  remaining  associates  recehed  their  stipend  and  expense  payments 
via  checks  sent  in  the  US  mail. 

HSAP  program  participants  were  considered  actual  RDL  employees,  and  their  respective  state  and 
federal  income  tax  and  Social  Security  were  withheld  from  their  paychecks.  By  the  nature  of  their 
independent  research,  SFRP  and  GSRP  program  participants  were  considered  to  be  consultants  or 
independent  contractOTS.  As  such,  SFRP  and  GSRP  associates  were  responsible  for  their  own  income 
taxes.  Social  Security,  and  insurance. 

8.  CONTENTS  OF  THE  1998  REPORT 

The  complete  set  of  rq)orts  for  the  1998  SRP  includes  this  program  management  report  (Volume  U 
augmented  by  fifteen  volumes  of  final  research  reports  by  the  1998  associates,  as  indicated  below: 


1998  SRP  Final  Report  Volume  Assignments 


1  LABORATORY 

SFRP 

GSRP 

HSAP 

Ajmstroi^ 

2 

7 

12 

niiUips 

3 

8 

13 

Rook 

4 

9 

14 

Wright 

5A,  5B 

10 

15 

AEDC,  ALCs,  USAFA,  WHMC 

6 

11 

7 


APPENDIX  A  -  PROGRAM  STATISTICAL  SUMMARY 


A.  Colleges/Uimersities  Represented 

Selected  SFRP  associates  represented  169  different  colleges,  universities,  and  institutions, 
GSRP  associates  represented  95  different  colleges,  universities,  and  institutions. 

B.  States  Represented 

SFRP  -A{^licants  came  from  47  states  plus  Washington  D.C.  Selectees  rqjresent  44  states. 
GSRP  -  Applicants  came  from  44  states.  Selectees  represent  32  states. 

HSAP  -  Applicants  came  from  thirteen  states.  Selectees  represent  nine  states. 


Total  Number  of  Participants  I 

1  SFRP 

85 

GSRP 

40 

1  HSAP 

88 

TOTAL 

213 

j  Degrees  Represented 

SFRP 

GSRP 

TOTAL 

1  Doctoral 

83 

0 

83 

1  Master's 

1 

3 

4 

Bachelor's 

0 

22 

22 

TOTAL 

186 

25 

109 

.\-l 


1  SFRP  Academic  Tides  | 

1  Assistant  Professor 

36  I 

Associate  Professor 

34  1 

Professor 

15  1 

Instructor 

0  1 

Chairman 

0  1 

Visiting  Professor 

0  1 

Visiting  Assoc.  Prof. 

0 

j  Research  Associate 

0 

1  TOTAL 

85 

Source  of  Learning  About  the  SRP 


Category 

Applicants 

Seleaees 

Ai^lied/participated  in  prior  years 

111 

47 

CoUeagne  familiar  with  SRP 

104 

24 

Brochure  mailed  to  institution 

101 

21 

Contact  with  Air  Force  laboratory 

101 

39 

IEEE  Spectrum 

12 

1 

BIIHE 

4 

0 

Other  source 

117 

30 

TOTAL 


616 


162 


APPENDIX  B  -  SRP  EVALUATION  RESPONSES 


1.  OVERVIEW 

Evaluations  were  completed  and  returned  to  RDL  by  four  groups  at  the  completion  of  the  SRP.  The 
number  of  respondents  in  each  group  is  shown  below. 


Table  B-1.  Total  SRP  Evaluations  Received 


Evaluation  Group 

Re^nses 

SFRP  &  GSRPs 

100 

HSAPs 

75 

USAF  Laboratory  Focal  Points 

84 

USAF  Laboratory  HSAP  Mentors 

6 

All  groups  indicate  unanimous  enthusiasm  for  the  SRP  experience. 


The  summarized  recommendations  for  program  improvement  from  both  associates  and  laboratory 
personnel  are  listed  below: 


A.  Better  pr^)aration  on  the  labs’  part  prior  to  associates’  arrival  (i.e.,  olfice  space, 
computer  assets,  clearly  defined  scope  of  work). 

B.  Faculty  Associates  suggest  higher  stipends  for  SFRP  associates. 

C.  Both  HSAP  Air  Force  laboratory  mentors  and  associates  would  like  the  summer  tour 
extended  from  the  current  8  weeks  to  either  10  or  1 1  weeks;  the  groq)s  state  it  takes  4- 
6  weeks  just  to  get  high  school  students  up-to-speed  on  what’s  going  on  at  laborax>ry. 
(Note:  this  same  argument  was  used  to  raise  the  faculty  and  graduate  student 
participation  time  a  few  years  ago.) 
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2.  1998  USAF  LABORATORY  FOCAL  POINT  (LFP)  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  84  LFP  evaluations  received. 
1.  LFP  evaluations  received  and  associate  preferences: 


Table  B-2.  Air  Force  LFP  Evaluation  Responses  (By  Type) 


How  Many  Associates  Would  You  Prefer  To  Get ' 

> 

(%  Respmise) 

SFRP 

GSRP  (w/Univ  Professor) 

GSRP  (w/o  Univ  Professor) 

Lab 

Evals 

Reev’d 

0 

1 

2 

3+ 

0 

1 

2 

3+ 

0 

1 

2 

3+ 

AEDC 

0 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

WHMC 

0 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

AL 

7 

28 

28 

28 

14 

54 

14 

::8 

0 

86 

0 

14 

0 

USAFA 

1 

0 

100 

0 

0 

100 

0 

0 

0 

0 

100 

0 

0 

PL 

25 

40 

40 

16 

4 

88 

12 

0 

0 

84 

12 

4 

0 

RL 

5 

60 

40 

0 

0 

80 

10 

0 

0 

100 

0 

0 

0 

WL 

46 

30 

43 

20 

6 

78 

17 

4 

0 

93 

4 

2 

0 

Total 

84 

32% 

50% 

13% 

5% 

80% 

11% 

6% 

0% 

73% 

23% 

4% 

0% 

LFP  Evaluation  Summary.  The  summarized  responses,  by  laboratory,  are  listed  on  the  following 
page.  LFPs  were  asked  to  rate  the  following  questions  on  a  scale  from  1  (below  average)  to  5  (above 
average). 

2.  LFPs  involved  in  SRP  associate  ^plication  evaluation  process: 

a.  Time  available  for  evaluation  of  applications: 

b.  Adequacy  of  ^plications  for  selection  process: 

3.  Value  of  orientation  trips: 

4.  Length  of  research  tour: 

5  a.  Benefits  of  associate's  work  to  laboratory: 
b.  Benefits  of  associate’s  work  to  Air  Force: 

6.  a.  Enhancement  of  research  qualifications  for  LFP  and  staff: 

b.  Enhancement  of  research  qualifications  for  SFRP  associate: 

c.  Enhancement  of  research  qualifications  for  GSRP  associate: 

7.  a.  Enhancement  of  knowledge  for  LFP  and  staff: 

b.  Enhancement  of  knowledge  for  SFRP  associate: 

c.  Enhancement  of  knowledge  for  GSRP  associate: 

8.  Value  of  Air  Force  and  university  links: 

9.  Potential  for  future  collaboration: 

10.  a.  Your  working  relationshp  with  SFRP: 
b.  Your  working  relationship  with  GSRP: 

11.  Expenditure  of  your  time  worthwhile: 

(Ck)ntinued  on  next  page) 
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12.  Quality  of  program  literature  for  associate: 

13.  a.  Quality  of  RDL’s  communications  with  you: 

b.  Quality  of  RDL's  communications  with  associates: 

14.  Overall  assessment  of  SRP: 


Table  B-3.  Laboratory  Focal  Point  Reponses  to  above  questions 


AEDC 

AL 

USAFA 

PL 

RL 

WL 

#  Evals  Reev  ’d 

7 

1 

14 

5 

0 

46 

Question  # 

2 

- 

86  % 

0  % 

88  % 

- 

85  % 

2a 

_ 

4.3 

n/a 

3.8 

- 

3.6 

2b 

- 

4.0 

n/a 

3.9 

4.5 

4.1 

3 

- 

4.5 

n/a 

4.3 

4.3 

- 

3.7 

4 

- 

4.1 

4.0 

4.1 

4.2 

- 

3.9 

5a 

- 

4.3 

4.3 

4.6 

- 

4.4 

5b 

- 

4.5 

n/a 

4.2 

4.6 

- 

4.3 

6a 

- 

4.5 

5.0 

4.0 

4.4 

- 

4.3 

6b 

- 

4.3 

n/a 

4.1 

5.0 

- 

4.4 

6c 

- 

3.7 

5.0 

3.5 

5.0 

- 

4.3 

7a 

- 

4.7 

5.0 

4.0 

4.4 

- 

4.3 

7b 

- 

4.3 

n/a 

4.2 

5.0 

- 

4.4 

7c 

- 

4.0 

5.0 

3.9 

5.0 

- 

4.3 

8 

- 

4.6 

4.0 

4.5 

4.6 

- 

4.3 

9 

- 

4.9 

5.0 

4.4 

4.8 

- 

4.2 

10a 

- 

5.0 

n/a 

4.6 

4.6 

- 

4.6 

10b 

- 

4.7 

5.0 

3.9 

5.0 

- 

4.4 

11 

- 

4.6 

5.0 

4.4 

4.8 

- 

4.4 

12 

- 

4.0 

4.0 

4.0 

4.2 

- 

3.8 

13a 

- 

3.2 

4.0 

3.5 

3.8 

- 

3.4 

13b 

- 

3.4 

4.0 

3.6 

4.5 

- 

3.6 

14 

- 

4.4 

5.0 

4.4 

4.8 

- 

4.4 
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3.  1998  SERF  &  GSRP  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  120  SFRP/GSRP  evaluations  received. 

Associates  were  asked  to  rate  the  following  questions  on  a  scale  from  1  (below  average)  to  5  (above 
average)  -  by  Air  Force  base  results  and  over-all  results  of  the  1998  evaluations  are  listed  after  the 
questions. 

1 .  The  match  between  the  laboratories  research  and  your  field; 

2.  Your  working  relationship  with  your  LFP: 

3.  Enhancement  of  your  academic  qualifications: 

4.  Enhancement  of  your  research  qualifications: 

5.  Lab  readiness  for  you;  LFP,  task,  plan: 

6.  Lab  readiness  for  you:  equipment,  supplies,  facilities: 

7.  Lab  resources: 

8.  Lab  research  and  administrative  support: 

9.  Adequacy  of  brochure  and  associate  handbook: 

10.  RDL  communications  with  you: 

11.  Overall  payment  procedures: 

12.  Overall  assessment  of  the  SRP; 

13.  a.  Would  you  apply  again? 

b.  Will  you  continue  this  or  related  research? 

14.  Was  length  of  your  tour  satisfactory? 

15.  Percentage  of  associates  who  experienced  difficulties  in  finding  housing: 

16.  Where  did  you  stay  during  your  SRP  tour? 

a.  At  Home: 

b.  With  Friend: 

c.  On  Local  Economy; 

d.  Base  Quarters: 

17.  Value  of  orientation  visit: 

a.  Essential: 

b.  Convenient: 

c.  Not  Worth  Cost: 

d.  Not  Used: 

SFRP  and  GSRP  associate’s  responses  are  listed  in  tabular  format  on  the  following  page. 
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Table  B4.  1997  SFRP  &  GSRP  Associate  Responses  to  SRP  Evaluation 


AnoU  1 

Brooks 

Edwirds 

Kgfci 

Griffis 

Hanscom 

lOrtlftiid  1 

LftckiMid 

Rofabis  1 

WPAfB  j 

•mage 

# 

res 

■ 

48 

< 

14 

31 

19 

m 

1 

10 

257 

1 

KEI 

Mfm 

ESM 

kei 

43 

gra 

■a 

EH 

BB 

KSB 

mwm 

2 

i^l 

mm 

eem 

msm 

4.7 

ETil 

. 

El 

Bi 

4.8 

3 

43 

msM 

wMfm 

1^39 

43 

42 

43 

mm 

5AJ 

5.0 

43  1 

43 

KEI 

4 

43 

Ka 

EO 

mwm 

4.4 

43 

mm' 

IBS 

4.0 

KEB 

43 

mm 

5 

43 

43 

33 

4.8 

KEI 

43 

43 

42 

BK 

5.0 

KEB 

mm 

6 

43 

43 

msM 

kei 

43 

gfil 

3.8 

5.0 

3.8 

43 

7 

43 

msM 

"4X1 

4.8 

43 

43 

43 

KIK 

BK 

B1 

43 

43 

8 

43 

EEl 

kei 

43 

43 

MMM 

B1 

KSB 

43 

kei 

9 

msm 

43 

43 

ESM 

43 

BB 

B1 

■SB 

43 

43 

10 

43 

kei 

msm 

eei 

4.1 

KOI 

gfii 

43 

BB 

LirSj 

KOI 

KEB 

43 

11 

3.8 

MSM 

gfil 

3.9 

4.1 

giii 

Kll 

BB 

B1 

K&fl 

KEB 

BB 

12 

msm 

rxn 

EH 

43 

4.9 

ESI 

mm 

BB 

LiU 

mm 

1  43  J 

EE^I  1 

Numbers  below  are  percentages 

13ft 

83 

90 

83 

93 

87 

75 

100 

81 

100 

100 

100 

86 

87 

i3b 

100 

89 

83 

100 

94 

98 

100 

94 

100 

100 

100 

94 

93 

96 

100 

90 

87 

100 

92 

100 

100 

70 

84 

88 

17 

6 

33 

20 

76 

33 

25 

100 

20 

8  1 

39 

1 

26 

17 

9 

38 

23 

33 

4 

- 

- 

- 

30 

EE] 

100 

33 

. 

40 

- 

8 

- 

- 

- 

- 

36 

2 

16c 

. 

41 

83 

40 

62 

69 

67 

96 

100 

100 

64 

68 

16d 

- 

- 

- 

- 

- 

- 

- 

BB 

- 

17t 

33 

100 

17 

50 

14 

67 

39 

- 

IB! 

40 

31 

35 

17b 

• 

21 

. 

17 

10 

14 

- 

24 

- 

lEB 

20 

16 

16 

17c 

. 

- 

- 

10 

7 

- 

- 

- 

L3_ 

- 

2 

3 

17d 

100 

46 

- 

66 

30 

69 

33 

37 

100 

1_ 

40 

51 

46 
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4.  1998  USAF  LABORATORY  HSAP  MENTOR  EVALUATION  RESPONSES 

Not  enough  evaluations  received  (5  total)  from  Mentors  to  do  useful  summary. 
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5.  1998  HSAP  EVALUATION  RESPONSES 


The  sununarized  results  listed  below  are  from  the  23  HSAP  e\aluations  received. 

HSAP  apprentices  were  asked  to  rate  the  following  questions  on  a  scale  from 
1  (below  average)  to  5  (above  average) 

1 .  Your  influence  on  selection  of  topic/type  of  work. 

2.  Working  relationship  with  mentor,  other  lab  scientists. 

3.  Enhancement  of  your  academic  qualifications. 

4.  Technically  challenging  work. 

5.  Lab  readiness  for  you:  mentor,  task,  work  plan,  equipment. 

6.  Influence  on  your  career. 

7.  Increased  interest  in  math/science. 

8.  Lab  research  &  administrative  support. 

9.  Adequacy  of  RDL’s  Apprentice  Handboc^  and  administrative  materials. 

10.  Responsiveness  of  RDL  communications. 


11.  Overall  payment  procedures. 

12.  Overall  assessment  of  SRP  value  to  you. 

13.  Would  you  apply  again  next  year? 

Yes 

(92 

%) 

14.  Will  you  pursue  future  studies  related  to  diis  research? 

Yes 

(68 

%) 

15.  Was  Tour  length  satisfactory? 

Yes 

(82 

%) 

Arnold 

Bnx^ 

Edvi'ards 

Eglin 

GrifBss 

Hanscom 

Kirtland 

Tyndall 

WPAFB 

Totals 

# 

5 

19 

7 

15 

13 

2 

7 

5 

40 

113 

resp 

1 

2.8 

3.3 

3.4 

3.5 

3.4 

3.2 

3.6 

3.6 

3.4 

2 

4.4 

4.6 

4.5 

4.8 

4.6 

4.4 

4.0 

4.6 

4.6 

3 

4.0 

4.2 

4.1 

4.3 

4.5 

EH 

4.3 

4.6 

4.4 

4.4 

4 

3.6 

3.9 

4.0 

4.5 

4.2 

4.6 

3.8 

4.3 

■a 

5 

4.4 

4.1 

3.7 

4.5 

4.1 

El 

3.9 

3.6 

3.9 

El 

6 

3.2 

3.6 

3.6 

4.1 

3.8 

3.3 

3.8 

3.6 

El 

7 

2.8 

4.1 

4.0 

3.9 

3.9 

WSM 

3.6 

4.0 

El 

3.9 

8 

3.8 

4.1 

4.0 

4.3 

4.0 

4.3 

3.8 

MM 

4.2 

9 

4.4 

3.6 

4.1 

4.1 

3.5 

4.0 

3.7 

3.8 

10 

4.0 

3.8 

4.1 

3.7 

4.1 

3.8 

3.8 

11 

4.2 

4.2 

3.7 

3.9 

3.8 

3.7 

2.6 

3.7 

3.8 

12 

4.0 

4.5 

4.9 

4.6 

4.6 

Bl 

4.6 

4.2 

4.3 

4.5 

1  Numbers  below  are  percenta 

ges 

13 

60% 

95% 

100% 

100% 

85% 

100% 

100% 

100% 

90% 

92% 

14 

20% 

80% 

71% 

80% 

54% 

100% 

71% 

80% 

65% 

68% 

15 

100% 

70% 

71% 

100% 

100% 

50% 

86% 

60% 

80% 

82% 

B-7 


MANY-BODY  THEORY  OF  QUANTUM-WELL  GAIN  SPECTRA 


Dr.  Lok  C.  Lew  Yan  Voon 
Assistant  Professor 
Department  of  Physics 


Worcester  Polytechnic  Institute 
100  Institute  Road 
Worcester,  MA  01609 


Final  Report  for: 

Summer  Faculty  Research  Program 
Wright-Patterson  Research  Site 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Air  Force  Base,  DC 
and 

Wright-Patterson  Research  Site 


July  1998 


18-1 


MANY-BODY  THEORY  OF  QUANTUM- WELL  GAIN  SPECTRA 


Dr.  Lok  C.  Lew  Yan  Voon 
Assistant  Professor 
Department  of  Physics 
Worcester  Polytechnic  Institute 


Abstract 


The  aim  of  this  summer  project  is  to  develop  a  theory  and  computer  code  for  the  group  of 
Dr.  John  Loehr  in  the  Sensors  Directorate  to  assist  them  in  analyzing  fine  details  of  the  gain 
spectra  of  quantum- well  laser  diodes  being  measured.  Such  an  in-house  capability  should  help  the 
Air  Force  Research  Laboratory  in  developing  new  types  of  semiconductor  lasers.  We  have  critically 
studied  the  different  theoretical  models  of  material  gain  (particularly  with  respect  to  the  many- 
body  effects)  currently  available  in  the  literature,  developed  a  computer  code  for  modelling  the 
gain  at  a  level  comparable  to  the  most  advanced  work  reported  so  far,  and  uncovered  deficiencies 
in  current  theories. 
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MANY-BODY  THEORY  OF  QUANTUM- WELL  GAIN  SPECTRA 


L.  C.  Lew  Yan  Voon 


1  Introduction 

The  Sensors  Directorate  of  the  Air  Force  Research  Laboratory  is  currently  investigating  the  use  of 
diode  lasers  operating  at  high  temperatures  (w  150  C)  and  high  modulation  frequencies  (w  10  GHz) 
to  replace  the  electrical  back-plane  of  an  aircraft  with  an  optical  back-plane.  High-temperature 
operation  is  essential  because  cooling  the  laser  requires  energy  and  space,  which  are  both  very 
limited  onboard  an  aircraft.  Due  to  the  importance  of  semiconductor  lasers  to  the  Air  Force 
mission,  we  have  aimed  to  bring  our  own  contribution  to  the  development  of  a  theory  of  material 
gain.  This  will  allow  the  Air  Force  researchers  to  have  direct  access  to  a  sophisticated  computer 
code  in  order  to  design  and  characterize  new  semiconductor  quantum  well  (QW)  lasers.  Our  first 
aim  WEIS  to  reproduce  current  models  and  test  them  at  high  temperatures.  In  the  process,  we  have 
also  uncovered  extensions  to  those  models  which  we  will  be  investigating  in  future  work. 

2  Methodology 

The  fundamental  property  of  a  laser  medium  is  the  gain  spectrum.  The  gain  spectrum  is  solely 
determined  by  the  external  conditions  (injection  current,  temperature, . . . ),  the  layer  structure,  and 
the  interacting  system  of  electrons,  photons  and  phonons.  This  is  a  complex  and,  as  yet,  unsolved 
problem.  The  methodology  of  the  current  work  partitions  the  problem  into  three  considerations 
(which  are,  however,  not  independent):  the  dimensionality,  the  electronic  band-structure  problem, 
and  the  interactions  (or  many-body)  problem.  We  call  the  final  solution  to  the  problem  the  many- 
body  gain  (MBG).  In  the  following  section,  we  will  outline  the  microscopic  derivation  of  the  MBG 
expression  based  upon  the  semiconductor  Bloch  equations  (SBE’s).  Our  approach  in  the  Results 
section  is  to  then  start  from  a  simplified  solution  and  to  successively  add  contributions  toward  a 
full  band-structure  and  many-body  expression. 
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3  Many-Body  Theory 

3.1  Derivation  of  equations 


3.1.1  Hamiltonian 


The  starting  point  is  the  second-quantized  Hamiltonian  for  an  interacting  electron  gas  in  a  crystal 
with  the  inclusion  of  the  electron-photon  interaction.  Since  we  are  dealing  with  electrons  in  a 
crystal,  we  explicitly  represent  the  field  operators  in  terms  of  the  orthonormalized  Bloch  functions 
and  the  corresponding  annihilation  operators: 

Ip  —  ^ni/«rk(r)  flniy<r(k).  (ij 

ni/<rk 

We  have  chosen  a  basis  set  of  wave  functions  which  are  labelled  by  a  band  index  n,  a  subband 
index  v,  a  spin  index  a,  and  a  wave  vector  k.  The  wave  function  will  now  be  written  explicit  as  a 
QW  state: 

^nvo-k  (r)  =  e**'  P  ^  /Awo-k  (^)“to(r)  t)^,  (2) 

where  k  is  a  2D  in-plane  wave  vector,  p  an  in-plane  position  vector,  A  the  in-plane  normalization 
area,  f{z)  the  envelope  function,  u,o(r)  the  bulk  zone-center  cellular  function,  and  a  spinor.  The 
Coulomb  and  electron-photon  terms  become 
1 


He  = 


2A 


51/  5Z  5^  *l)  ®n3 1/3  0-2  <^7141/40-1  (k^) 


X  v{niVi(7ik!  -f  q,  n2i'20’2k”  —  q;  nzvzcr^^",  n4i/4<rik^). 


(3) 


with 


t;(nii/io-ik'  -I-  q,  -  q;  nzVzCTzk",  niV^a-ik') 

~  (2eoe6g)^^  ■^"li^o-lk'+q(^)^2»ao-2k"-q(^  )/n3i>3tr2k"(^0/^i/4trik'(^)® 

=  F,(2n)^,(l,2,3,4), 


and 


with 


”1  ”2  <^1 0-2  kq 

VI V2  ^ 


(4) 

0>n2V2cr2^  Q)dnin2 

•  "Eniviffi  (k,  q,  t), 
n2t/2<r2 

(5) 

(z\ 

' niuio-xVy  ) J n2iA2<T2\i--ti 

(z)E(q,Z,t). 

(6) 

*X»2 
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3.1.2  Interband  polarization 

The  mticroscopic  interband  polarization  induced  by  a  coherent  monochromatic  classical  light  field 
is  given  by 

P(t)  =  Jdr  P(r,  t)  =  jdr  (V'^(r,  t)cr^(r,  t)) 

)  ^n2i^<r2k 

(7) 


tj 

n2l^O’3K2 


=  V  (k)dnin27(»ll*'l^l)  n2P2(^2i  k). 

Tl2*^^2 

niVliri 

n2V20’2k 

In  particular,  for  interband  transitions, 


giving 


(8) 


(9) 


pm  =  V  P..a<ri(k)d^/(v*/i(7-i,ci/2<r2,k)  +  Fc.^<ri(k)dc/(ci/i<7-i,t;*/20-2,k) 

'  '  ■  *  Cl^tf2  Vl^tr2 

VI  <ri  L 

i>2<r2K 

=  eoxE(r.  0  =  ^oX  +  c.c.] , 


gtWt 

X=-^  V  P.»i«ri(k)d^/(t;i/i<Ti,ci/2<r2,k). 
V2<r2^ 


3.1.3  Gain 

The  amplitude  gain  and  the  phase  change  are  given  by 

.dS  iu 

where  in,  is  the  background  refractive  index  and  the  phase  change  gives  the  carrier-induced  refractive 
index  change: 

^  =  -1^.  (11) 

rih  q  dz 


(10) 


3.1.4  Equation  of  motion 

Using  the  above  Hamiltonian,  we  can  now  derive  the  equation  of  motion  for  the  polarization 


The  complete  general  result  is 


(k)  =  [Pc.v(k)  -  E^^4k)]P,..(k) 

OZ  ci/'jr'  cv'tr* 

+  X)  X)  *^"1"  ■  +  *!>  *)(“n,i/i<ri  (k  +  q)Oci/'<T'(k)) 

niuitri  q 

It'lO’l  (k  -  q)> 

nit/10‘1  q 

+  ^  X)  X/  {®("i‘'i®‘ik'  +  q,n2J/20-k-q;t;i/<rk,n3i/3<rik') 

»*x”2»»3  k^q^O 
Ui1/2^<^1 

^{^hit/ioi  (k'  +  q)4,^,<r(k  -  q)  ®n3i<3<ri  (k')aci.V'(k)>  (13) 

-w(nit/i<rk  +  q,  n2i/20’ik'  -  q;  nsVaaik',  vvak) 

^  (®ni»/l<7'(k  "I"  q)®n2i^<7’j  (k  ~  q)®n3i>3<7-i  (k  )<lci/<i-'(k)) 

+t;(ci/V'k,  niViCTik!  -  q;  n2V2(Tik',  n^vzor'k  -  q) 

^  (®ii/<r(k)^nii/l<Ti  (k  ~  q)®n2V2<ri  (k  )fl  (k  -  q)) 

-i;(niJ/iorik'  +  q,  ci/V'k;  n2i/20’'k  +  q,  7131/30-1  k') 

^  {®t;i/<T (k)®nii^<7-i  (k  4"  q)fln2«/2o-'  (k  4-  q)a  n3i/^<ri  (k'))}- 

We  now  make  the  Hartree-Fock  (HF)  approximation.  A  key  result  of  the  HF  approximation  is  the 
lack  of  carrier-carrier  scattering.  This  would  have  been  present  in  the  polarization  equation  as  an 
imaginary  polarization  term.  This  can  also  be  seen  in  the  renormalization  of  the  energies  by  a  real 
static  self-energy.  Scattering  terms  will  therefore  have  to  be  obtained  separately.  The  equation  of 
motion  is  then 


UZ  ev  ci/'<r' 

+  XX)  ‘^"1"  ■  +  *!»  0(4i./|<ri  (k  4-  q)ac..V'(k)) 

nii/ioi  q 

“  ^  ^  ^  ^  dcni  ‘E  (^J  (k  ^  q)) 

nit/ioi  q 

■*"  T  S  n2i/2^ik  -  q;  t;i/<rk,  n3i/3(rk  -  q) 

qatO 

^(^n2i^<r(k  ^^nzt/^ox  (k  —  (k)flci/V'(k)) 

--t;(ci/V'k,  nii/i(Tik  -  q;  n2i/2<^ik,  rtzVztr'Vi  -  q) 

X  (4«.a(k)  ^n2i<2<ri  (k)>(  ^hit/ioi  (k-q)  (k-q))}. 
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Equation  (14)  shows  that,  in  the  absence  of  the  Coulomb  potential,  the  polarization  P„m(k)  satisfies 
a  simple  first-order  diflTerential  equation  whose  solution  is  the  well-known  free-CEirrier  polarization. 
However,  when  the  Coulomb  potential  is  retained,  Pnm(k)  is  coupled  to  Pnm(q)  and  to  other 
Pn'm'O^'),  leading  to  a  coupled  set  of  integro-differential  equations.  This  problem  is  extremely 
complicated  and  has  not  even  been  referred  to  in  the  literature.  We  start  by  solving  the  simplified 
problem  considered  by  others  and  which  goes  under  the  name  of  the  semiconductor  Bloch  equations 
(SBE). 


3.1.5  Semiconductor  Bloch  equations 


The  SBE  can  be  obtained  as  approximations  to  our  Eq.  (14).  The  traditional  derivation  (see,  e.g.. 
Ref.  [1])  starts  from  a  simplified  Hamiltonian,  whose  Coulomb  term  consists  only  of  the  same  two 
conduction-band  indices  and  of  the  same  two  valence-band  indices.  It  is  then  straightforwardly 
transformed  into  the  electron-hole  representation.  Using  Eq.  (14),  one  first  uses  the  electric  dipole 
and  two-band  approximations  giving 


*^^Pvc(k)  — 


Ecik)  -  4  Xl  "2k  -  q;  ck,  ngk  -  q)(4,(k  -  q)a„3(k  -  q)) 


njns 

q#0 


-  < 


E„(k)  -  4  X]  -  q)an3  (k  -  q)) 


P«c(k) 


+  (^ck  “  ^vk) 


dc  •  E«(k,  0,  t)  -I-  2  X)  "2*^  “  *1;  "3^  “  q)(43 (k  -  q)a„3 (k  -  q)) 


112^3 

q^O 


The  single-particle  energies  are  then  replaced  by  renormalized  ones,  only  terms  linear  in  the  particle 
densities  are  kept,  and  collision  terms  (which  are  absent  at  the  HF  level)  are  added. 


iR^-(e4k)-e,(k)) 


■fwc(k)  —  ^«k)  ^dc  •  Ee(k,  0,  t) 


(15) 


+  4  t;(ck,  vk  -  q;  wk,  ck  -  q)P,c(q) 


qjtO 


—  ih 


Tk-?vc(k)  X!  Tk,q-kPi;c(q) 
q 


There  are  equivalent  equations  for  the  carrier  distributions  but  we  will  not  be  interested  in  them 
due  to  the  time  scales  involved. 
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3.1.6  Many-body  gain 


We  now  solve  Eq.  (15)  to  obtain  the  gain  formula.  The  polarization  function  can  be  formally 
written  as 

p„(k,0  =  |-ink(»,t')Ai((')  -  STk,,-kilc(q,(')|,  (16) 

where 

£*^k  =  ^  [ec(k)  -  e„(k)] ,  Ak(f)  =  nck(t)  -  n„v(«)  (17) 

nk(z,i)  =  i  dc  •E.(k,0,<)  + 1  J]T;(ck,vk- q;vk,ck- q)P„c(q,«)  •  (18) 

L  J 

We  now  make  the  quasiequilibrium  approximation  for  which  the  carrier  densities  vary  little  with 
time  and  the  rotating-wave  approximation  which  is  valid  at  optical  frequencies  and  consists  in 
dropping  the  integrated  part  of  the  polarization  which  is  not  resonant.  Then 

P„c(k,t)  =  --Ak(t)dc„  •E.(k,0,t)7^^^j— (19) 

+  Z)  -r-^«(ck,uk- qiuk.ck- q) -7k,q_k]  /*  <it'e(‘‘^+T^)(‘'-‘)p„c(q,f'). 

q^O  I  ^  ^  J  ./-oo 

Finally,  one  makes  use  of  the  Fade  approximation.  One  assumes  that  the  polarization  can  be 
iterated  in  terms  of  the  Coulomb  potential  (the  Mott  regime)  and  the  consequent  series  resummed 
using  the  lowest-order  Fade  approximation.  The  zeroth-order  (free-carrier)  polarization  is 


n^c^(k,t) 

-  -^A^(0d„  .  E.(k,  0, 0;,^  _ 

(20) 

where 

,.(0)  /f ^  (*^)  [”ck  (t)  -  n„k  (t)] 

^  Eoh  —  Ci^)  +  * 

(21) 

and  dev  =  e  ■  dev-  The  next  term  is 

=  i:  [vn, + ’'2^,5"]  xf>(«) 

(22) 

=  eg>:L“^(0^e--‘Q(k,t), 

(23) 

where  the  Coulomb  potential 


^"-ql  =  «(ck,  vk  -  q;  uk,  ck  -  q), 
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and  the  Coulomb  enhancement 


One  finally  heis 


a(E)-i^  = 


I4»(k)p{/ci  -  M 


'  dz  2eoehfiV  “  i(u;k  -  +  7k  1  -  eoQ(k)  *)  ’ 

where  the  Fade  approximation  has  been  used. 


(24) 


3.2  Analysis  of  band-structure  and  many-body  effects 

Having  derived  one  form  of  the  gain  equation,  we  would  like  to  identify  a  number  of  features 
present  which  derive  from  the  dimensionality,  the  electronic  dispersion  relation,  and  the  many- 
body  theory.  For  the  dimensionality,  there  are  two  considerations:  ideal  2D  and  quEisi-2D  (q2D). 
For  the  electronic  dispersion,  there  are  a  whole  host  of  issues:  infinite  or  finite  barriers,  parabolic 
bands,  nonparabolicity  or  full  band  structure,  constant  or  fc-dependent  optical  matrix  elements.  The 
many-body  theory  gives  rise  to  pleisma  screening.  Coulomb  potential,  band-gap  renormalization. 
Coulomb  enhancement,  carrier-carrier  scattering,  and  carrier-phonon  scattering.  Recall  that  these 
considerations  are  not  all  independent.  Indeed,  we  will  be  mainly  interested  in  how  the  first  two 
affect  the  last  one. 


3.2.1  Finite  barriers 


The  simplest  model  of  electron  confinement  is  that  due  to  an  infinite  barrier,  for  which  there  are 
exact  solutions  for  the  confinement  energies  of  a  single  QW: 


E,= 


(25) 


A  much  better  fit  to  transition  energies  can  be  obtained  by  root  finding  the  solutions  to  the 
dispersion  relation  for  a  finite-barrier  QW: 


j=-, 


(26) 


where 


k  = 


2m:i,E 


l2ml{Vi,-E) 


and  ml,  {ml)  is  the  well  (barrier)  effective  mass  and  T4  is  the  potential  barrier  height. 


18-9 


3.2.2  Nonparabolicity 

We  have  introduced  nonparabolicity  into  the  conduction  band  by  using  a  Kane-type  modification 
of  the  effective  mass  into  an  energy-dependent  one: 

m*(B)  _  E,  ^  W 

m*  Eg  +  E’  2m*{Ey 

giving  the  electron  energy  2is 

Em. 

2  ■ 

3.2.3  Optical  matrix  elements 


E  = 


-1  +  t  1  + 


4 

Yg  2m* 


As  a  first  approximation,  one  can  use  bulk  zone-center  matrix  elements  multiplied  by  a  polariza¬ 
tion  factor  for  the  selection  rules  involving  heavy-hole  and  light-hole  states,  and  an  overlap  factor 
between  the  envelope  functions.  Within  the  infinite-barrier  approximation,  the  overlap  factor  leads 
to  the  selection  rule  Av  =  0.  The  dipole  matrix  element  is  then  given  by 


I  <?ull.(k)(l-|?  ‘P)  c-hh 
1  C,l.(k)(5  +  l?-t|’)  <=-lh 


where  e  is  the  unit  polarization  vector  and  k  the  growth  direction. 

One  can  also  assume  constant  bulk  matrix  elements  or  introduce  a  Kane-type  k  dependence: 


^mn(k)  —  dmn(O) 


^mn(O) 

£?mn(k)' 


(30) 


3.2.4  2D  vs.  quasi-2D 


One  of  the  places  where  current  theories  is  most  lacking  is  in  a  proper  treatment  of  the  quasi- 2D 
nature  of  the  theory.  Even  the  latest  and  most  sophisticated  published  work  in  the  field  do  not 
appear  to  consider  the  quasi-2D  nature  fully  and  consistently  [1,  2,  3,  4,  5].  The  basic  difference 
between  a  pure  2D  and  a  q2D  theory  of  the  electrons  lies  in  the  neglect  of  the  envelope  function  and 
in  the  subband  structure  of  the  band  structure.  While  some  workers  have  taken  the  subband  nature 
of  the  electrons  into  consideration  in  computing  the  single-particle  energies,  they  nonetheless  still 
used  a  2D  form  for  the  Coulomb  potential. 
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(31) 


We  here  give  the  exzict  2D  form  factor  for  the  infinite-barrier  case: 

_  •>  r  f  p+m— n,0  "I*  m+n,0  ~  ^/+p+m— n,0  ~  <^J+p— m+n,0 

F,(lmnp)  =  - [(m  -  - 

p+m+n,0  ~l~  p— m— n,0  ~  ^t+p+m+n,0  ~  ^l+p—m—n,0  1 

[(m  +  n)^7r^  -t-  q^Ll,]  J 

-  Ll,{lmnp) 

1  ^  (_ljm+n+z+p  _  (_lJn»+nJg-gXia, 

^  [(m  +  n)27r2  +  q^Ll,][{m  -  n)H^  -1-  q^Ll][{l  -b  q^Ll][{l  -  +  q^L^]  ’ 

=  Iq{lmnp)  +  Hq{lmnp).  (32) 

This  result  had  previously  been  given  by  Ell  and  Haug  [6]. 

3.2.5  Quasi- Fermi  levels 

We  assume  rapid  thermalization  within  the  conduction  and  the  valence  bands  such  that  the  carrier 
distributions  can  be  described  by  quasi-Fermi-Dirac  distributions  and  one  quzisi-Fermi  level  (or 
chemical  potential)  for  each  band. 

The  simplest  model  is  for  a  perfect  2D  system  with  parabolic  bands.  There  is  an  exact  expression 
for  the  chemical  potential  if  there  is  only  one  occupied  subband: 


Pp  —  In 


-1 


(33) 


where  rii  is  the  areal  carrier  density.  If  there  are  occupied  subbands,  the  chemical  potential  has 
to  be  found  from  the  root  of  the  following  equation: 

m*  .  r.  .  s(^-AE^io))] 


n  =  y;iln[n- 
^  irh^P  L 


(34) 


where  n  is  the  net  areal  carrier  density  in  the  given  band,  and  AE„(0)  is  the  energy  of  the  bottom 
of  each  subband  v  with  respect  to  the  bottom  of  the  lowest  subband  (the  reference  energy  for  the 
definition  of  p).  In  the  electron-hole  picture,  the  Fermi-Dirac  distribution  is 

1 


/.(k) 


e/3(B.(k)+AB.(0)-p)^.i- 


(35) 


3.2.6  Plasma  screening 

In  order  to  incorporate  correlation  effects,  one  can  screen  the  Coulomb  potential  using  the  Lindhard 
dielectric  function.  In  practical  terms,  one  often  make  use  of  the  static  plasmon-pole  approximation: 


^(q) 


=  1 _ tL 


o;;: 


(36) 
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where 


2  2 
u)  z=  ii; 


■)■ 


(37) 

(38) 

(39) 


«  =  .EW^  =  «EW^(i- 

where  Ly^  is  the  well  width,  the  areal  czirrier  density  in  subband  v,  the  total  number  of 
subbands  in  the  problem,  and  C  a  constant  usually  taken  between  1  and  4.  These  screening 
parameters  have  been  derived  for  the  ideal  2D  and  parabolic  band  cases.  This  should  be  a  suitable 
approximation. 


3.2.7  Hartree-Fock  self  energy 

We  obtain  the  following  exchange  energy  within  the  HF  approximation: 

=  “T  IZ  n2i/2«Tk  -  q;  wi/erk,  n2i/20-k  -  q)/nj.^a(k  -  q).  (40) 

a.qjto 

Note  that  all  the  spin  indices  are  the  same,  confirming  that  it  is  the  exchange  contribution.  This  is 
a  generalization  of  the  exchange  self-energy  expression  (due  to  the  summation  over  the  2  quantum 
numbers)  found  in  the  many-body  gain  literature,  although  we  have  found  an  identical  result  in 
the  band-gap  renormalization  work  of  Ryan  and  Reinecke  [7].  In  practice,  one  also  screens  the 
Coulomb  potential. 

3.2.8  Coulomb  hole  self-energy 

At  the  HF  level,  the  Coulomb  hole  (CH)  self  energy  is  introduced  ad  hoc  to  account  for  electron 
correlation  (screening)  due  to  the  Coulomb  interaction.  It  is  defined  to  be 

A£;CH  =  ^  ^  [V,{q)  -  V{q)]  =  V,{r  =  0)  -  F(r  =  0).  (41) 

q#o 

In  Fourier  space,  it  is  seen  to  represent  the  diflference  between  a  screened  Coulomb  energy  and  the 
unscreened  one;  it  is  thus  a  correction  to  the  HF  theory  which  does  not  contain  screening  effects. 
In  real  space,  one  sees  that  it  is  the  difference  in  the  self-energy  of  a  charge  interacting  with  itself 
due  to  the  screened  and  unscreened  Coulomb  potential;  of  course,  ecich  self-energy  is  divergent  but 
the  difference  is  well-defined. 
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3.2.9  Scattering  terms 


Carrier- carrier  scattering 

Tk  =  |{rf+r|:“)  (42) 

i  S  £2TV,yi'',i'"';K",i')i((„(k)  +  v(k')-e^(k  +  q)-e^»(k'-q)) 

^  [/i/'k+q(l  “  /i/^kOA'^'k'— q  "f*  (1  “  /i/'k+q)  A'^k'Cl  "  /i/'"k'— q)]  j 

Tk.,  =  s  (re,  +  ru,)  (44) 

X  [(1  “  /i/k)(l  /i/"k')/*^'''k'— q  "I"  /i/k/i/''k'(^  “  /i/'"k'— q)]  • 

These  results  are  taken  from  Ref.  [8]. 

Carrier- LO  phonon  scattering 

We  consider  the  Prohlich  mechanism  between  the  carriers  and  the  3D  phonons: 

rLo,k  =  Z)  ;r  -  ^>'0^  -  q)  -  -q) 

^tQV  \fcoo  c,/  g|| 

+iV'ph(5(e„(k)  -  €„(k  4-  q)  +  fta;,)F''(k,  q)] ,  (46) 

where 

J=”(k,q)=  E  //**'/;i'(*)a‘(*')4k-,(4')4l-,(k^)'"’"''‘'''-  (47) 

»1— *4 

3.3  Results 

We  now  list  some  of  the  key  results.  Our  canonical  system  is  a  100  A  GaAs  infinite-barrier  quan¬ 
tum  well,  using  a  two-band  parabolic  approximation  with  a  constant  linewidth  broadening  factor. 
External  conditions  are  a  temperature  of  300K  and  a  C2«’rier  density  of  3  X  10^®cm“^. 

•  Band-gap  renormalization  redshifts  the  gain  spectrum. 

The  SX  contribution  is  Jfe-dependent  and  smaller  than  the  CH  energy  [Fig.  1,2].  The  BGR  is 
reduced  if  form  factors  are  used  for  the  Coulomb  matrix  elements.  With  the  q2D  form  factor 
within  the  infinite-barrier  approximation,  the  BGR  is  of  the  order  of  40-50  meV. 

•  The  many-body  gain  is  enhanced  over  the  free-carrier  value  [Fig.  3]. 

This  is  predominantly  due  to  the  Coulomb  enhancement  factor. 
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•  A  nonparabolic  conduction  band  increases  the  high-energy  slope  of  the  gain  spectrum. 

•  Use  of  a  constant  optical  matrix  element  (instead  of  a  ife  dependent  one)  leads  to  a  slight  lower 
gain,  the  effect  being  more  pronounced  for  the  many-body  gain  [Fig.  3]. 

•  The  intrasubband  carrier-carrier  scattering  is  dominated  by  the  h-h  scattering  over  e-e  and 
both  are  reduced  by  the  form  factors.  They  add  up  to  «  15  meV  [Fig.  4]. 

•  The  intrasubband  e-LO  phonon  Frohlich  interaction  contributes  w  5  meV  to  the  linewidth  [Fig.  5]. 


4  Summary 

Using  an  infinite-barrier  solution  to  the  electron  problem,  we  have  evaluated  the  many-body  gain 
of  quantum-well  structures.  This  has  allowed  a  detailed  understanding  of  the  influence  of  various 
factors  on  the  many-body  efects  and  on  the  gain. 

A  number  of  issues  were  identified  for  further  study: 

•  Incorporate  a  realistic  band  structure. 

We  are  currently  adapting  a  four-band  band-structure  code  and  concurrently  upgrading  our 
gain  program.  Once  these  tasks  are  completed,  we  will  combine  the  two. 

•  Continuum  states. 

High-temperature  operation  of  the  laser  will  require  consideration  of  the  continuum  states. 
This  would  constitute  novel  work. 

•  Fully-quantized  Hamiltonian. 

This  would  constitute  a  generalization  of  the  formalism  presented  in  the  current  report  and 
would  allow  one  to  study  other  effects  at  the  same  level  (e.g.,  spontaneous  emission  and 
phonons) . 

•  Electron-phonon  interaction. 

We  have  here  only  considered  optical  phonons  with  flat  dispersion  relations.  It  is  known  that 
the  electron-phonon  scattering  rate  is  modified  if  realistic  phonon  dispersions  are  used. 
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Figure  3:  Free-carrier  gain  and  many-body  gain  (using  a  two-term  iteration  and  Fade  resummation) 
with  constant  and  jk-dependent  optical  matrix  elements. 
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Influence  of  form  factor  on  e-LO  phonon  scattering 


Figure  5;  Influence  of  form  factor  and  wave  vector  on  electron-LO  phonon  scattering. 
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Abstract 


This  report  summarizes  the  result  of  a  two-month  research  effort  in  investigation  of 
georeferencing  in  multiplatform  and  multisensor  based  object  location.  The  environment 
used  in  this  report  can  be  characterized  by  a  set  of  dynamic  platforms  such  as  satellites, 
aircraft,  and  terrestrial  vehicles  with  various  sensors,  for  example,  GPS,  INS,  SAR,  EO, 
and  FLIR.  The  sensors  may  be  installed  on  the  platforms  separately  or  combined.  It  is 
assumed  that  communication  between  the  platforms  is  available,  so  that  real-time 
applications  demanding  coordination  among  the  platforms  can  be  carried  out.  The  object 
to  be  located  is  simplified  as  a  point  on  the  ground.  Data  acquired  by  this  kind  of 
integrated  sensor  system  can  be  characterized  by  its  vast  volume,  comprehensive  spatial 
relationships,  and  associated  temporal  dynamics. 

After  a  brief  introduction,  chapter  2  reviews  mathematical  models  of  least  squares 
adjustment  and  gives  a  comparison  between  two  methods  used  in  modeling  sensor 
uncertainties.  A  conceptual  model  of  dynamic  object  location  by  a  multiplatform  and 
multisensor  system  is  introduced  in  chapter  3.  In  chapter  4,  design  of  an  experiment  using 
optical  images,  a  SAR  image,  and  a  terrain  model  is  given  to  test  the  concept  in  chapter  3. 
Finally,  conclusions  are  drawn  in  chapter  5. 
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A  STUDY  ON  REFERENCING  ISSUES  IN  MULTIPLATFORM  AND 
MULTISENSOR  BASED  OBJECT  LOCATION 

Rongxing  (Ron)  Li 


1.  Introduction 

This  study  investigates  referencing  issues  related  to  object  location  using  data  such  as 
imagery  and  navigation  data  acquired  by  a  multiplatform  and  multisensor  system.  The 
environment  used  in  this  report  can  be  characterized  by  a  set  of  dynamic  platforms  such 
as  satellites,  aircraft,  and  terrestrial  vehicles  with  various  sensors,  for  example,  GPS,  INS, 
SAR,  EO,  and  FLIR.  The  sensors  may  be  installed  on  the  platforms  separately  or 
combined.  It  is  assumed  that  communication  between  the  platforms  is  available,  so  that 
real-time  applications  demanding  coordination  among  the  platforms  can  be  carried  out. 
The  object  to  be  located  is  simplified  as  a  point  on  the  ground.  Data  acquired  by  this  kind 
of  integrated  sensor  system  can  be  characterized  by  its  vast  volume,  comprehensive 
spatial  relationships,  and  associated  temporal  dynamics. 

In  principle,  one  sensor  or  a  combination  of  sensors  allow  determination  of  the  location 
of  the  object  relative  to  the  sensor(s)  (Moffitt,  and  Mikhail  1980,  Curlander  and 
McDonough  1991,  Layne  and  Blasch  1997).  The  navigation  sensors  add  absolute 
information  into  the  system.  Thus,  the  relative  location  of  the  object  is  referenced  to  a 
ground  coordinate  system  (Li  1996).  Since  various  sensors  have  their  strengths  and 
weakness,  a  well-designed  strategy  is  necessary  to  minimize  the  uncertainty  of  the  object 
location  using  the  given  platforms  and  sensors.  Particularly,  issues  of  individual  sensor 
and  platform  uncertainty,  correlation  between  sensors  and  platforms,  and  final  object 
location  uncertainty  are  concerned. 

This  report  discusses  Least  Squares  models  in  Chapter  2,  concept  of  an  improved 
dynamic  model  in  Chapter  3,  design  of  an  experiment  in  Chapter  4,  and  conclusions  in 
Chapter  5. 


2.  Least  Squares  Model  for  Object  Location 

Suppose  that  1  is  a  vector  representing  observations  such  as  ranges  and  range  rates  from 
SAR  and  image  coordinates  measured  from  electro-optic  or  FLIR  images.  Vector  X 
(state)  contains  all  unknown  parameters  such  as  platform  locations  and  coordinates  of  the 
object.  In  principle,  the  relationship  between  the  observations  and  the  state  can  be 
described  by 

1  +  V  =  F(X)  (1) 

where  V  is  a  vector  of  white  noise  (corrections)  associated  with  the  observation  1. 
Linearization  of  (1)  using  a  Taylor  series  leads  to 
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l  +  V  =  F(Xo)  +  AAX  +  0(AX).  (2) 

Here  X=Xo+AX,  with  Xo  being  the  approximate  value  of  X  and  AX  being  increment 
based  on  Xo.  Matrix  A  is  a  Jacobian  matrix  defined  as 


<5c, 

(3) 


0(AX)  is  higher  order  terms.  Assume  that  AX  is  small  and  we  neglect  0(AX).  The 
observation  equation  can  be  written  as 


V  =  A  AX  +  F(Xo)  - 1, 

(4) 

V-AAX  +  L. 

(5) 

Each  observation  will  make  one  row  in  matrix  equation  (5).  The  uncertainty  of 
observation  f  is  represented  by  its  variance  Similarly  the  covariance  between  h  and  Ij 
is  aij.  The  covariance  matrix  of  the  observation  vector  1  is 


f  2 

CM 

2  \ 

^12 

CT, 

In 

II 

11 

^21 

■ 

■ 

The  weight  matrix  is  defined  as  W=  R''.  Here  is  the  unit  weight  variance. 


W,2  . 

II 

II 
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NJ 

II 

^21 

••  ^2. 

Kl  ■ 

•• 

(6) 


(7) 


If  the  observations  are  uncorrelated,  CTij=0.  Thus,  Zn  is  a  diagonal  matrix,  so  is  W,  with 
Wij=o  (i^’^^j)  and  Wj  = 

The  objective  is  to  estimate  the  unknown  vector  AX  from  the  observations  L  (actually  1) 
by  Equation  (5)  that  is  usually  overdetermined  (more  equations/observations  than 
unknowns).  A  Least  Squares  Adjustment  (LSA)  is  going  to  determine  the  AX  so  that 
V^WV=Sum(WiVi^)  =min.  A  normal  equation  is  built  as 
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or 


(A'^WA)  ax  +  a'^wl  =  0 
NAX+  A'^WL  =  0. 


(8) 


The  LSA  estimate  is 

AX  =  -isr*  A'^WL.  (9) 

The  covariance  matrix  of  AX  is 

Sx  =  CTo^  N*'  =  ao^(A'^WA)-* .  (10) 

There  are  different  forms  of  the  LSA  model  including  some  extended  models,  for 
example,  those  discussed  in  Moffitt  and  Mikhail  (1980),  Leick  (1995),  Mikhail  and 
Grade.  Extension  of  LSA  using  Kalman  filter  for  handling  GPS  data  and  other 
dynamically  collected  data  can  he  found  in  Leack  (1995). 

The  object  to  be  located  has  coordinates  (X,  Y,  Z).  Its  covariance  submatrix,  the  upper- 
left  3X3  submatrix,  can  be  extracted  from  the  covariance  matrix  of  the  state 


We  can  describe  the  location  uncertainty  by  an  ellipsoid  that  is  derived  from  the 
covariance  submatrix  (Mihkail  and  Gracie). 

2.1  General  LS  model 

A  general  LS  model  has  an  observation  equation 

AX  +  BV  =  f,  (11) 

B  is  a  coefficient  matrix  for  V  that  is  unit  matrix  -I  in  Equation  (5).  Assume  that  Ve=BV 
and  le=Bl.  Re=BRB^  Equation  (11)  becomes 

AX  +  Ve  =  f. 
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Comparing  to  Equation  (5),  the  LS  solution  is 


X  =  [a^  aWa'  f  \  and 

R^=[A\BRB'^y^j^\ 


2.2  Application  in  object  location 

Suppose  that  we  have  observations  from  n  sensors  (or  one  sensor  measuring  n  times)  to 
locate  an  object.  The  state  vector  is  X  =  (Xt,  Yt,  Zt,  Xsi,  Ysi,  Zsi,  . . Xsn,  Ys„,  Zsn)^. 
Here  we  compare  two  ways  of  using  LS  to  solve  the  problem.  Method  I  splits  the  state 
vector  to  two  vectors,  object  state  Xi  =  (Xt,  Yt,  Zt)  ^  and  sensor  state  X2  =  (Xsi,  Ysi,  Zsi, 
. . .,  Xsn,  Ysn,  Zsn,)  X2  is  then  combined  with  noise  vector  V.  The  observation  equation 
becomes 


(Ai  A2) 


fx,^ 

IxJ 

+  V  =  Ai  Xi  +  (A2 1) 

-  Ai  Xi  +  B 

J 

J 

=  f. 


(12) 


The  general  LS  model  is  applied  to  treat  Xi  as  unknowns  and  (X2  V)^  as  noise.  The  LS 
solution  is 

Xi  =  {A,^  [(A2  I)  R  (A2  I)'’]  Ai }  (A/  [(A2  I)  R  (A2 1)  '^1  f} 

=  [Ai"^  (B  R  B^)-*  Ai]-'  [A/  (B  R  B^)"*  f]. 

Rxi  =  { Ai^  [(A2 1)  R  (A2  I)  Ai }  -* 

=  [Ai''(BRB"')-*A,]-‘.  (13) 

Comments  are:  a)  the  direct  measurements  are  ranges,  range  rates,  and  others,  not  the 
sensor  locations  X2;  b)  the  sensor  state  should  be  treated  as  signals  (unknowns  to  be 
estimated),  instead  of  a  part  of  noise,  the  squared  sum  of  which  is  minimized  by  LSA;  c) 
depending  on  prior  variances  given  to  X2,  the  estimate  of  Xi  may  be  adjusted  either  too 
much  or  less.  We  should  note  that  measurements  are  made  between  Xi  and  X2.  There  is 
usually  correlation  between  them.  Also,  there  should  be  correlation  between  the 
observation  1,  noise  V,  Xi  and  X2.  Any  assumption  of  independence  between  them  needs 
to  be  proven. 

Method  II  proposes  to  treat  both  object  location  and  sensor  locations  as  unknowns  to  be 
estimated.  The  observation  equation  is 


(A,  A2) 


'X, 

X 


2J 


+  V  =  f 


The  LS  solution  is 

X  =  (Xi  X2)^  =  [(A,  A2)''  R-‘  (Ai  A2)]  [(Ai  A2)^  R-‘  f| 


(14) 
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(15) 


Rx- 


Rx. 

V^x,, 


R> 

R 


Xj  ) 


=  [(AiA2)^R-‘(A,  A2)] 


The  covariance  matrix  of  the  object  location  Rxi  is  a  3x3  upper-left  submatrix  in 
Equation  (15).  Unless  someone  can  prove  that  Equations  (13)  and  (15)  are  identical,  the 
estimates  fi'om  Method  I  and  II  will  be  different.  The  advantages  of  Method  II  are  that 
correlations  between  object  and  sensor  locations  are  considered;  sensor  locations  are 
estimated  as  the  results  of  LSA;  and  the  imcertainty  estimated  should  be  more  realistic. 


2.3  An  example 

Laser  ranges  are  used  to  locate  an  object  as  described  in  Figure  1.  Measured  are  ranges  rm 
fi'om  sensor  (Xs,  Ys,  Zs)  to  the  object  at  (X,  Y,  Z),  with  noise  of  Vm- 


Figure  1.  Laser  ranges  for  object  location  (X,  Y,  Z) 


The  observation  equation  is 

■■.+V,.  =  |r-r.|  =  fex-X.)‘  +(Y-Y,)“  +(Z-Z.)T‘ 

=  ULOS,  dX  +  ULOSy  dY  +  ULOS,  dZ  -  ULOS,  dX,-ULOSy  dY, 

-ULOS,dZ,+r,+0(A) 

Here  ULOS,  are  direction  cosines  of  the  unit  vector  of  the  Line  of  Sight  (LOS): 


ULOSx= 


_ (Y-YJ _ 

^(x-x,f+(Y-rf+(z-zf 


ULOSy= 


_ (Y-Y,) _ 

^(X-XJ^+(Y-Yf+(Z-Zf 
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UL0S2=  and 

^(X-XJ^+iY-Yf+iZ-Zf 

r=  ^(X  -X,y  +  (Y-YJ  +  (Z  -ZJ . 


The  linearized  observation  equation  is 


r„-r„=(ULOS,  ULOS^  ULOSz  -ULOSx  -ULOSy  -ULOSJ 


^dX^ 
dY 
dZ 
•^dX, 
dY, 
vdZ„ 


+  (-Y,„). 


Note  that  r„-r„  is  small,  bud  usually  not  zero.  dX,  dY,  dZ,  dXj,  dYs,  and  dZs  are 
unknowns  to  be  estimated.  They  are  signals,  not  noises  in  . 


If  Method  I  would  be  used,  the  observation  equation  should  be: 


fdX^ 

(dxA 

dY, 

dZ, 

r,n-ro=QALOS.  ULOSy  ULOS J 

dY 

+(-ULOS  ^-ULOS  ^ 

-ULOS^-l) 

=  AX  +  BV. 

In  comparison  to  Equation  (13)  for  Method  I, 


^dX^ 

dY 

II 

X 

,  v  = 

dY, 

,dZ, 

dZ 

V 

k  fm  y 

The  solution  is  then 


X  =  [a^  f\ 

R,=[A\BRB^rA\\ 


To  compare  with  Method  I,  we  reformat  the  observation  equation  as 
(A  B)  (Xi  Xa)'^  +  V  =  (AB)X  +  V  =  f 


(16) 
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Method  II  gives  a  LS  estimate  and  covariance  matrix  of  X 


X  =  [(A  Bf  R‘‘(A  B)]-’  [(A  Bf  R-‘f] 

Rx  =  [(ABf  R-'(AB)]-'  (17) 

Note  that  matrix  B  in  Equations  (15)  and  (17)  are  different.  The  upper-left  3x3  submatrix 
is  the  covariance  matrix  Rx  of  the  object  location  Xi.  This  submatrix  is  usually  not  the 
same  of  Rx  in  Equation  (16). 


2.4  Discussion 

In  observation  equation  (1)  or  (4),  1  contains  measurements  (available);  V  is  white  noise 
associated  with  1  (can  be  determined  after  LSA).  Once  Xo  is  given,  L=F(Xo)  - 1  can  be 
calculated;  A  is  a  "design"  matrix  (to  be  calculated  before  LSA).  Finally,  AX  is  unknown 
(to  be  determined).  There  are  three  phases  of  LSA. 

Phase  0  (Planning):  Before  actual  measurements,  1  is  not  available  and  AX  cannot  be 
calculated.  However,  given  expected  quality  of  observations  1,  the  covariance  matrix, 
which  is  often  diagonal  under  the  assumption  that  observations  are  uncorrelated,  can  be 
derived.  The  weight  matrix  W  is  then  available,  given  unit  weight  variance  <3o.  The 
design  matrix  A  depends  on  the  configuration  of  the  overall  system  such  as  arrangement 
of  sensors,  sensor  types,  and  other  conditions.  Xo  is  needed  to  calculate  A,  but  not  AX 
and  1.  Consequently,  the  covariance  matrix  of  AX,  Sx,  can  be  computed  using  Equation 
(10).  Furthermore,  for  any  vector  variable  Z  that  is  a  function  of  X,  Z=BX,  its  covariance 
matrix  can  be  derived  by  an  error  propagation: 

Ez  =  BSxB\  (18) 

In  general,  the  task  in  the  plaiming  phase  is  to  investigate  the  expected  quality 
(covariance  matrix)  of  the  unknowns  and  other  derivatives  without  actual  observations.  If 
coordinates  (Xp,  Yp,  Zp)  of  an  object  P  to  be  determined  are  contained  in  X=(. .  .Xp,  Yp, 
Zp),  their  variances  and  covariances  are  contained  in  Ex.  The  error  ellipsoid  that 
represents  the  uncertainty  of  the  expected  location  is  characterized  by  its  three  axes. 

Efforts  should  be  made  to  analyze  the  quality  and  combinations  of  the  individual  error 
sources  of  observation  h  represented  in  the  weight  matrix  of  observations  and 
configuration  of  sensors  reflected  in  the  design  matrix  A.  An  estimation  of  the  object 
location  uncertainty  can  then  be  obtained.  Another  way  is  to  fix  the  uncertainty  of  the 
object  location  so  that  the  covariance  matrix  of  the  unknowns  is  fixed.  Inversely  the 
weight  matrix  or  design  matrix  can  be  calculated. 

Phase  1  (Data  Processing):  It  deals  with  evaluation  of  the  vector  AX  from  observations  1. 
Since  the  number  of  equations  is  usually  larger  than  the  number  unknowns,  a  least 
squares  solution  is  given  by  Equation  (9).  The  estimate  of  the  unknowns  is  then 
X=Xo+AX.  Note  that  the  high  order  terms  in  Equation  (2)  were  neglected.  This  requires 
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AX  to  be  determined  iteratively  with  the  updated  X  being  new  approximate  value  of  Xo. 
This  procedure  will  continue  until  AX  is  significantly  small. 

Phase  2  (Uncertainty  Analysis):  In  Equation  (10)  an  estimate  of  the  covariance  matrix  of 
X  is  given.  It  describes  the  variances  of  the  estimated  unknowns  and  covariances  between 
unknown  parameters.  Uncertainty  obtained  in  Phase  0  is  the  predicted  uncertainty. 
However,  uncertainty  estimated  in  Phase  2  is  the  one  associated  with  the  LS  solution  of  X 
in  Phase  1.  For  example,  error  ellipsoids  of  estimated  object  locations  can  be  visualized 
and  analyzed.  In  addition,  imcertainties  of  parameters  that  are  functions  of  the  unknowns 
can  also  be  calculated.  Statistic  tests  can  also  be  used  to  test  the  data  processing  result 
from  Phase  1  to  conclude  if  the  result  is  acceptable  or  not  (Mikhail  and  Grade). 


3.  Concept  of  an  Improved  Dynamic  Model 

Object  location  is  critical  to  a  wide  range  of  applications,  such  as  military  targeting, 
rescue  operations,  emergency  management,  and  environmental  monitoring.  Real-time 
operations  are  often  required.  Sensors  involved  in  such  a  real-time  system  (Figure  2)  are 
from  multiple  platforms  and  of  various  classes  such  as  SAR,  laser,  infrared,  electron- 
optic  array,  hyperspectral,  GPS,  INS  and  others.  To  meet  the  goal  of  accurate  object 
location,  a  good  understanding  of  the  characteristics  of  individual  sensors  and  their 
collective  impact  on  the  object  location  is  needed.  To  solve  the  problem  of  multiplatform 
and  multisensor  based  target  location,  we  should  address  a  series  of  issues  on  analysis  of 
object  location  uncertainty,  simulation  of  sensor-object  correlation,  uncertainty 
minimization,  and  optimization  of  platform  and  sensor  configuration. 

This  requires  an  irmovative  mathematical  model  for  characterizing  imcertainties  of 
individual  sensors  as  well  as  those  associated  with  sensor  and  target  locations  to  be 
estimated.  A  dynamic  model  for  target  location  and  uncertainty  propagation  will 
accurately  update  the  target  location  and  uncertainty  in  real-time  based  on  multiplatform 
and  multisensor  observations  collected  up  to  the  current  time.  If  the  location  uncertainty 
is  too  high,  the  system  will  suggest  additional  observations  using  the  available  sensors 
and  platforms  within  the  configuration.  An  extended  least  squares  estimator  will 
determine  the  final  target  location  using  all  observations  collected  to  give  the  highest 
accuracy  possible. 
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T2:  Optical  Sensor 


The  strategy  to  handle  the  comprehensive  multiplatform  and  multisensor  observations  is 
to  separate  direct  measurements  (e.g.  time)  of  sensors  from  their  resulting  geometric 
output  (e.g.  range),  so  that  existing  sophisticated  sensor  processing  software  will  deal 
with  sensor  internal  errors  to  compute  geometric  output  and  estimate  associated  errors. 
Algorithms  and  software  systems  for  processing  signals  within  a  sensor  are  usually 
provided  hy  manufactures  or  other  sources.  Geometric  observations  from  sensors  are,  for 
instance,  r2inge  and  range  rate  from  SAR,  range  from  laser,  image  coordinates  from 
airborne  or  satellite  optical/infrared  images,  locations,  angles  and  velocity  from  GPS  and 
INS.  A  covariance  matrix  of  the  observations  is  usually  simplified  to  a  diagonal  matrix, 
indicating  non-correlation  between  the  observations.  The  task  is  then  to  examine 
correlations  within  a  sensor  (for  example  range  and  range  rate),  within  a  platform,  and 
within  a  sensor  system.  Depending  on  the  research  results  the  observation  covariance 
matrix  may  or  may  not  be  diagonal.  This  will  certainly  affect  the  target  location  and 
uncertainty. 
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3.1  Target  uncertainty  in  an  integrated  sensor  system 

The  uncertainty  of  an  object  location,  represented  by  a  3-D  ellipsoid,  is  usually  large, 
based  on  a  reconnaissance  resource  (e.g.  satellite  in  Figure  2).  But  this  approximate 
location  provides  an  initial  location  for  platforms  with  higher  accurate  sensors  to  reduce 
the  location  uncertainty.  For  example,  given  GPS  and  INS  navigation  accuracies,  aircraft 
takes  radar  range  measurements  of  the  object  at  tk  and  tk+i  (may  not  be  “continuous”. 
Figure  3  (a)).  The  resulting  ellipsoid  of  the  object  is  relatively  flat  (almost  parallel  to  XY- 
plane),  namely  good  vertical  accuracy  and  relatively  poor  horizontal  accuracy.  If  a  FLIR 
image  either  from  the  same  platform  of  the  radar  or  another  platform  is  available  and 
used  in  determination  of  the  target  location,  the  corresponding  ellipsoid  will  become 
smaller  in  horizontal  plane  and  have  relatively  homogeneous  errors  in  all  three  directions 
(Figure  3(b)).  It  should  be  noted  that  the  size  and  orientation  of  the  object  location  error 
ellipsoid  are  ftmctions  of  a  number  of  factors  in  the  sensor  system  such  as  types  of 
measurements,  from  where  they  are  taken  with  respect  to  the  object,  and  combined 
impact  of  all  sensors.  Such  a  comprehensive  error  behavior  demands  the  development  of 
a  mathematical  model  to  allow  flexible  analysis  of  all  the  factors  involved  for  the  optimal 
design  of  sensor  configuration,  mission  planning,  and  accurate  object  location. 


Figure  3.  An  example  of  uncertainty  reduction,  (a)  ellipsoid  by  a  two-range 
triangulation,  (b)  adding  a  FLIR  measurement  to  reduce  the  uncertainty. 


3.2  Dynamic  model 

Suppose  that  we  start  with  a  rough  position  of  the  object  determined  by  a  reconnaissance 
resource.  The  estimated  object  location  can  be  improved  when  new  observations  become 
available.  Each  platform  is  defined  by  a  body  position  and  orientation  which  are 
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determined  by  an  INS  unit.  Offsets,  including  translation  and  rotation,  between  the  body 
and  other  sensors  are  usually  estimated  by  a  calibration  procedure.  Sensor  orientation  is 
derived  from  the  platform  orientation  and  calibration  parameters.  The  accumulative 
nature  of  the  pointing  errors  will  be  investigated. 

In  a  dynamic  model,  observations  are  those  from  sensors.  The  state  vector  includes  three 
coordinates  of  the  object,  three  coordinates  and  three  orientation  angles  of  each  platform 
(instead  of  each  sensor)  so  that  the  size  of  the  state  vector  is  effectively  reduced.  The 
model  estimates  the  state  vector  and  covariance  matrix  at  tk+i  based  on  their  values  at  tk. 
The  covariance  matrix  of  observations  is  given  according  to  a  priori  knowledge  of  the 
quality  of  each  sensor  class.  The  dynamic  behavior  of  the  state  (including  object  location) 
and  uncertainty  during  transition  will  be  studied.  A  dynamic  estimator,  such  as  Kalman 
filter,  will  be  used  to  minimize  the  object/sensor  location  uncertainty  and  to  estimate  the 
optimal  object  location. 

In  contrast,  a  static  model  will  treat  all  observations  as  static  observations,  no  matter  from 
which  platform,  which  sensor,  and  when  taken.  The  observations  along  with  a  covariance 
matrix  will  be  input  into  a  Least  Squares  (LS)  adjustment  system  with  a  very  large 
observation  vector  and  a  very  large  state  vector.  The  optimal  state  vector  is  estimated  by 
minimizing  the  sum  of  squared  noises  of  all  observations.  An  efficient  algorithm  will  be 
developed  to  simplify  the  LS  model  to  solve  the  3x1  target  state  vector  and  3x3  target 
location  covariance  submatrix.  The  static  model  supplies  the  optimal  target  location  and 
associated  covariance  matrix  once  all  observations  are  taken. 

Based  on  above  dynamic  and  static  models,  we  are  able  to  simulate  a  multiplatform  and 
multisensor  mission  for  an  optimal  platform  and  sensor  configuration  given  a  desirable 
object  uncertainty.  The  models  can  also  be  used  for  determining  the  accurate  object 
location  if  actual  measurements  are  tjiken. 


3.3  Equations 

In  this  djmamic  model,  the  state  vector  at  any  time  t  is  defined  as 

X(t)  =  (X,Y,Z, ...,  Xpi,Ypi,Zpi,Ypj,fpj,Zp],<yp,,^pi,K’^i,<ypi,^pi,/Cpj, ..., 

Xpn,Ypn,ZpnYp„,yp„,Zp„,<Wp„,^p„,/£'p„,<yp„,^p„,/irp^)  .  (19) 

Here  (X,Y,Z)  are  coordinates  of  the  object  to  be  located.  The  rest  of  the  vector  are 
position,  velocity,  orientation  and  angular  velocity  of  n  platforms  at  time  t.  Once 
calibrated,  position  and  orientation  parameters  of  sensors  on  each  platform  can  be  derived 
based  on  the  parameters  of  the  platform,  calibration  parameters  (positional  and  angular 
offsets),  and  gimbal  readings  if  any.  The  dimension  of  X(t)is  3+12n.  Observations  at  time 
t  are 
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Kt)  ~  (Xpush,0,XFLIR,yFLIR,rsAR,44fl  5  TlaSERj  ^laser  >  •  •  • » 

XpijYpijZpi  Kpj,d}pi,^pj,Kpi,...  )  .  (20) 

Here  the  observations  may  be  coordinates  of  the  object  in  a  push  broom  satellite  image 
and/or  a  FLIR  image,  a  range  and  range  rate  from  a  SAR  image,  a  laser  range,  GPS  and 
INS  measurements  of  position,  velocity,  orientation  angles,  and  angular  velocity  of  the 
ith  platform.  In  the  dynamic  model  Rk  is  the  covariance  matrix  of  1  at  kth  iteration.  Lo  is 
the  approximate  value  of  L.  Z=L-  Lo  is  usually  very  small,  but  not  zero.  The  dynamic 
states  of  the  platforms  can  be  estimated  by  a  Kalman  filter  with  the  observation  equation 
at  time  tk: 


Zk-HkXk  +  Vk.  (21) 

The  covariance  of  Vk  is  E(VkVk^).  H  is  a  Jacobian  matrix  containing  partial  derivatives  of 
Z  with  respect  to  X.  Vk  is  white  noise  of  Zk.  The  dynamic  update  of  X  is  performed  by 

Xk+i  =  Ok  Xk  +  Wk,  (22) 


Where  Ok  is  state  transition  matrix  to  be  determined,  Wk  is  white  noise  of  Xk 
with  known  covariance  matrix  of  Pk.  Qk=E(Wk  Wk)  ^  will  be  developed. 

The  iterative  solution  of  the  problem  starts  with  given  prior  covariance  matrix  and 
the  approximate/prior  estimate  X~  of  Xk.  The  Kalman  gain  is  defined  as 

K,=  p-Hl{H,p-lH\+R,)-\  (23) 


The  state  vector  and  covariance  matrix  are  updated  by 
X,=  X;+K,{Z,-H,X;),and 

(24) 

p,=  a-K,H,)p;. 

To  repeat  the  process,  the  “project  ahead”  parameters  are 

^k+\ 

(25) 

^+1  ^i+Qk- 

Hk  is  usually  a  function  of  X  and  should  be  updated  along  with  Zk  each  time  when  a  new 
value  of  X  is  obtained.  From  Equation  (24)  it  is  obvious  that  diagonal  elements  of  KkHk 
must  be  smaller  than  1  to  make  a  variance  great  than  0.  Therefore,  in  design  stage 
alternative  configuration  of  platforms  and  sensors  can  be  made  to  produce  different 
matrix  Hk  and  Kalman  gain  so  that  the  object  location  error  or  the  trace  of  Pk  will  be 
minimized. 
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3.4  Object  recognition  with  georeference  information 

Object  recognition  is  a  process  to  find  an  object  from  a  set  of  sensed  data,  such  as  an 
optical  image  or  SAR  image.  Tremendous  efforts  have  been  made  by  scientists  to  solve 
this  problem.  The  progress  in  this  research  area  is  yet  modest  in  comparison  to  other  fast 
growing  areas  because  of  the  complexity  of  the  problem.  Target  recognition  can  be 
improved  by  using  georeference  information  in  the  following  ways: 

•  The  known  sensor  geometry  and  object/sensor  uncertainty  will  greatly  reduce  target 
searching  dimensions  and  ranges.  For  example,  a  two-dimensional  search  in  image 
domain  can  be  reduced  to  an  one-dimensional  search  along  an  epipolar  line  using 
epipolar  geometry  derived  from  known  camera  orientation  information; 

•  If  the  position  of  the  target  is  found  in  one  image,  its  position  in  other  images  (even 
from  different  sensors)  can  be  predicted  by  using  a  DTED  or  a  second  image.  This 
provides  a  very  strong  geometric  constraint  that  will  be  valuable  for  candidate 
validation  or  matching  between  images;  and 

•  Since  sensor  orientation  parameters  are  known,  a  great  deal  of  information  in  scene 
domain  can  be  used  to  assist  target  recognition  in  image  domain.  For  instance,  if  a  3- 
D  design  model  of  the  object  is  available,  through  a  transformation  from  scene  to 
image  domain,  a  comparison  between  the  design  model  and  the  target  candidate  in 
images  can  be  performed.  Once  the  object  is  recognized  in  all  images,  its  accurate 
location  in  the  scene  domain  can  then  be  determined  using  the  above  static  LS  model. 


4.  Design  of  an  Experiment 

To  verify  some  of  the  above  concepts,  an  experiment  with  actual  data  will  be  carried  out. 
The  data  set  includes  one  scene  of  aerial  photograph  taken  by  a  mapping  camera  and  a 
SAR  image  taken  by  a  spotlight  radar  system.  Both  images  cover  an  area  with  objects 
that  have  known  coordinates.  The  photograph  was  scanned  to  produce  a  digital  image  of 
4,000  rows  and  9,000  columns.  Camera  focal  length,  principal  point  location,  and  lens 
distortion  parameters  are  given.  The  SAR  image  has  a  dimension  of  1800x1800  pixels.  It 
covers  a  part  of  the  aerial  photograph.  Pixel  resolution,  scene  center,  and  other 
parameters  are  available.  This  experiment  is  a)  to  calculate  horizontal  coordinates  of 
known  objects  based  on  one  SAR  data  under  the  assumption  that  the  terrain  is  flat  in  the 
area,  b)  to  use  both  the  aerial  photograph  and  SAR  image  for  object  location,  and  c)  to 
find  either  another  aerial  photographs  or  a  DTED  of  the  area,  plus  the  aerial  photograph 
and  SAR  image,  for  precision  object  location. 
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4.1  Spotlight  SAR  for  horizontal  coordinates  calculation 

From  pixel  to  image  coordinates 

An  objects  is  measured  by  its  coordinates  of  (row,column)=(i  j)  in  pixel.  The  coordinates 
are  then  converted  to  image  coordinates  (x,y): 

x  =  -(i-900)dx  and  y  = -(j-900)  dy.  (26) 

(ij)“(900,900)  is  the  location  of  the  scene  center  in  the  image,  dx  and  dy  are  pixel  size  in 
row  and  column  direction  respectively. 

Correction  from  the  display  plane  to  horizontal  plane 

There  is  an  angle  Po  between  the  display  plane  and  the  horizontal  plane.  Image 
coordinates  (x,y)  on  the  display  plane  is  transformed  to  those  (x’,y’)  on  the  horizontal 
plane  by 

x’  =x 

y’  =  y  cosPo.  (27) 

Transformation  to  the  Processed  Scene  System 

If  the  ground  squint  angle  is  0,  the  angle  that  rotates  the  image  plane  to  the  PCS  is 


Gac^lSO^-G. 


The  coordinates  in  PCS  are  calculated  as 
Xp  =  x’  cosGac  +  y’  sinGac 

Yp  =  -x’  sinGac  +  y’  cosGac  •  (28) 

Suppose  the  terrain  in  the  area  is  flat,  then  (Xp,Yp)  are  coordinates  of  the  object  in  PCS. 

If  a  terrain  model  is  available,  corrections  on  coordinates  can  be  computed  according  to 
the  elevation  difference  between  the  point  and  the  ground  plane  of  the  image  -  elevation 
of  the  scene  center. 

Transformation  from  PCS  to  ECEF 

Transformation  PCS  to  Earth  Center  Earth  Fixed  (ECEF)  coordinate  system  consists  of  a 
rotation  and  a  translation.  The  rotation  is  defined  by  a  rotation  matrix  that  is  a 
function  of  longitude,  latitude,  and  heading  of  the  planed  flight.  The  amount  of 
translation  is  determined  by  coordinates  of  the  scene  center  (Xpsc,  Ypsc,  Zpsc)  in  ECEF: 
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4.2  Aerial  photograph  and  SAR  for  object  location 

The  objective  is  to  compute  the  vector  r  at  point  P(X,Y,Z)  by  measuring  its  image  point 
p(x,y),  assuming  that  the  camera  exposure  center  location  and  orientation  are  known. 


Z 


Figure  4.  Aerial  photograph  for  object  location 


The  observation  equation  is 


rop=f-rp,^„e=‘^R' 
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Here  (x,y)  are  measured  image  coordinates  of  the  object.  2?^“^  is  a  rotation  matrix  from 
the  earth  coordinate  system  to  the  sensor  coordinate  system  and  A,  is  a  scaling  parameter. 
Dividing  the  first  two  equations  by  the  third  one  in  Equation  (30)  results  in  collinearity 
equations: 
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(31) 


Additional  parameters  such  as  lens  distortion  can  be  added  to  the  above  equation.  For 
each  object  location  we  will  have  two  equations  for  each  aerial  photograph  and  two 
equations  for  the  SAR.  Thus,  we  can  solve  for  three  unknown  coordinates  of  the  object 
(X,Y,Z). 


4.3  Precision  object  location 

If  more  observations  such  as  image  coordinates  of  the  object  in  another  aerial 
photograph,  a  DTED,  or  another  SAR  image  are  available,  the  object  location  can  be 
determined  with  more  redundant  observations  so  that  the  reliability  and  accuracy  of  the 
location  can  be  enhanced.  Statistic  tests  will  be  used  to  check  the  object  location  estimate. 
Its  final  error  ellipsoid  is  derived  from  the  covariance  matrix  of  the  unknown  vector  of 
LS  adjustment. 


5.  Conclusions 

Based  on  the  above  research  results,  the  following  conclusion  can  be  drawn: 

•  High  accurate  and  reliable  object  location  in  a  multiplatform  and  multisensor  system 
requires  precise  modeling  of  individual  sensor  behavior,  geometric  configuration,  and 
combined  impact  of  the  integrated  system; 

•  Considerations  should  be  given  to  the  entire  procedure  of  simulation  and  prior 
uncertainty  estimation,  optimal  object  location  estimation,  and  post  analysis; 

•  A  dynamic  model  for  the  multiplatform  and  multisensor  system  should  be  developed 
and  implemented  to  meet  demand  from  a  number  of  real-time  applications;  and 

•  Georeferencing  can  enhance  capabilities  of  object  recognition  and  other  AI 
applications. 
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SENSOR  FUSION  WITH  PASSIVE  MILLIMITER  WAVE  (PMMW) 

AND  LASER  RADAR  (LADAR)  FOR  TARGET  DETECTION 
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Abstract 

Advanced  sensors  and  guidance  techniques  are  required  in  killing  mobile  offensive  and  defensive 
systems.  Many  different  sensors  such  as  radar,  video  camera,  laser  radar  (LADAR),  millimeter  wave 
(MMW)  systems,  infrared  images,  acoustic  sensors,  etc.  are  available  for  such  usage.  However,  no 
single  sensor  seems  to  provide  completely  satisfactory  capabilities.  While  some  sensors  have 
complimentary  capabihties,  integration  of  multiple  sensors  for  kill  can  relax  the  task  difficulty  and 
provide  more  reliable  results.  The  use  of  multiple  sensors  can  also  reduce  the  possibility  of  being 
defeated  by  countermeasures.  In  this  project,  we  studied  the  framework  and  investigated  potential 
techniques  for  integration  and  fusion  of  information  from  passive  millimeter  wave  (PMMW)  and  laser 
radar  (LADAR)  systems.  The  focus  has  been  on  target  detection.  The  PMMW  is  used  to  detect  metal 
objects  and  the  LADAR  checks  those  regions  of  interest  for  other  evidence  of  existence  of  a  target. 
Advantages  obtained  by  integrating  these  two  sensors  include  reduction  of  task  complexity  and 
improvement  of  reliability,  both  due  to  efficient  localization  of  regions  of  interests  from  the  PMMW. 
Since  PMMW  possess  weather  penetration  capabilities  through  fog,  cloud,  smoke,  etc.,  the  combined 
system  has  a  near  all  weather  capability.  A  LADAR  provides  three-dimensional  (3-D)  information  for 
checking  target  details.  It  should  be  used  as  the  primary  sensor  for  target  selection  upon  target  detection. 
The  framework  of  the  fusion  is  based  on  the  Dempster-Shafer  decision  method.  The  fusion  may  be  done 
in  the  algorithm  level  and  sensor  level.  With  the  Dempster-Shafer  method  as  the  framework,  new  sensors 
or  new  decision  components  can  be  integrated  into  the  existent  system  easily. 
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SENSOR  FUSION  WITH  PASSIVE  MILLIMITER  WAVE  (PMMW) 
AND  LASER  RADAR  (LADAR)  FOR  TARGET  DETECTION 


Chun-Shin  Lin 


I.  Introduction 

Advanced  sensors  and  guidance  techniques  are  required  in  killing  mobile  offensive  and  defensive 
systems.  Possible  sensors  include  radar,  video  camera,  laser  radar  (LADAR),  millimeter  wave  (MMW) 
systems,  infrared  images,  acoustic  sensors,  etc.  However,  no  single  sensor  seems  to  provide  completely 
satisfactory  capabilities.  For  instance,  the  laser  range  sensor  and  video  camera  are  susceptible  to  adverse 
weather  (such  as  the  situations  with  fog,  rain  or  cloud)  as  well  as  smoke  and  dust  screen.  On  the  other 
hand,  radar  and  millimeter  wave  signals  can  penetrate  fog,  smoke,  etc.  but  cannot  provide  as  good  spatial 
resolution  as  that  from  optical  sensors.  While  some  sensors  have  complementary  capabilities,  integration 
of  multiple  sensors  for  kill  can  relax  the  task  difficulty  and  provide  more  reliable  results.  The  use  of 
multiple  sensors  also  reduces  the  possibility  of  being  defeated  by  countermeasures. 

The  objective  of  this  project  is  to  investigate  frameworks  and  potential  techniques  for  integration 
and  fusion  of  information  from  passive  MMW  (PMMW)  [1][2]  and  LADAR  systems  [3][4].  Although 
the  study  focuses  only  on  the  two  sensors,  further  frision  with  other  sensors  can  be  implemented. 

In  this  report,  the  PMMW  and  LADAR  are  first  briefly  introduced  in  Section  2.  The  suggested 
frsunework  is  given  in  Section  3.  More  details  on  target  detection  through  fusion  are  presented  in  Section 
4.  Section  5  gives  experimental  results.  Data  fusion  and  the  use  of  the  Dempster-Shafer  method  are 
discussed  in  Section  6.  Discussion  and  conclusions  are  in  Section  7. 

II.  PMMW  and  LADAR 

2.1  Passive  Millimeter  Wave  (PMMWI 

Millimeter  wave  frequency  bands  span  from  30  to  300  GHz.  Due  to  atmospheric  attenuation,  the 
primary  windows  vwth  minimal  loss  occur  at  35,  94,  140  and  220GHz.  Millimeter  wave  sensors  at  these 
frequencies  provide  a  near  all-weather  capability.  They  represent  a  compromise  between  microwave 
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systems  and  the  electro-optical  systems.  The  MMW  systems  do  not  have  the  all-weather  capabilities  of 
microwave  systems,  but  provide  a  better  resolution.  Compared  to  eletro-optical  systems,  MMW  systems 
cannot  give  high  resolution  but  are  superior  in  situations  with  fog,  cloud,  smoke,  dust  screen,  etc. 

A  passive  millimeter  wave  system  receives  the  signal  reflected  and  emitted  by  observed  objects 
and  does  not  require  an  MMW  source.  This  reduces  the  cost  and  system  complexity,  and  also  reduces  the 
possibility  of  being  detected  in  a  missile  attack  mission. 

Antenna  temperature  is  one  means  for  describing  the  amplitude  of  the  sensed  signal.  The  sources 
that  determine  the  antenna  temperature  include  the  apparent  brightness  temperature  of  the  sky,  natural 
terrain  materials  and  metal  objects.  The  apparent  brightness  temperature  of  a  source  contains  both  the 
emitted  and  reflected  energy  from  the  source  surface.  Different  materials  have  different  reflection  and 
emitting  properties.  One  useful  property  for  target  detection  is  that  metal  hardly  emits  but  does  well 
reflect.  Figure  1  shows  PMMW  images  and  visible  light  photos  of  a  scene  under  good  and  poor  weather 
conditions.  It  is  seen  that  the  fog  does  not  affect  the  PMMW.  For  more  a  detailed  introduction  and 
discussion  on  PMMW,  please  refer  to  reference  1 . 

2.2  Laser  Radar  (LADAR) 

Laser  radar  uses  laser  light  for  measuring  distance.  It  can  provide  both  a  range  image  and  a 
reflectance  image  and  both  can  be  used  in  missile  guidance.  The  range  image  provides  3-D  information 
and  has  high  angle  resolution.  The  sensor  is  especially  good  for  target  classification/recognition  but  is 
susceptible  to  adverse  weather  and  cannot  image  through  thick  cloud,  fog,  etc.  Due  to  its  narrower  field- 
of-view  angle,  it  is  also  less  adequate  for  searching  targets  in  a  broad  area.  The  contrast  in  a  range  image 
is  usually  not  large  but  the  3-D  information  is  there. 
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Figure  1.  Visible  light  photos  and  PMMW  images  (from  TRW) 


2.3  Advantages  from  Integration  of  PMMW  and  LADAR 

PMMW  and  LADAR  have  complementary  capabilities  and  fiision  with  these  two  sensors  will 
provide  the  following  advantages: 

•  Make  the  detection  task  easier  and  the  results  more  reliable;  reduce  the  search  time. 

The  PMMW  has  good  capability  to  distinguish  metal  from  other  materials  and  the  LADAR  can 
easily  detect  objects  above  the  ground  level  from  3-D  information.  Combining  information  from 
these  two  sensors,  the  target  detection  task  becomes  easier.  In  the  situation  with  clutter  in 
background,  with  only  a  video  image  or  a  range  image,  target  detection  is  extremely  difficult  and 
requires  a  lot  of  complicated  work  [5]  [6].  However,  with  the  use  of  both  PMMW  and  LADAR, 
the  task  should  become  much  easier.  In  addition,  the  result  should  be  much  more  reliable. 


•  LADAR  provides  3-D  information,  which  is  the  most  complete  one  for  checking  target  details. 
LADAR  can  be  used  as  the  major  sensor  during  the  further  step  in  target  selection. 

•  The  combination  will  create  the  near-all-weather  capabilities. 

•  PMMW  could  provide  some  size  information  of  targets  in  a  shorter  distance.  Such  information  may 
become  useful  in  case  LADAR  foils  due  to  weather  or  other  problems. 

ni.  Framework  for  Integration/Fusion  with  PMMW  and  LADAR 

There  are  two  types  of  sensor  integration  -  the  competitive  sensor  integration  and  the 
complementary  sensor  integration.  The  former  uses  sensors  returning  essentially  the  same  information. 
One  example  is  the  use  of  identical  sensors  to  measure  a  pressure.  The  readings  can  be  slightly  different 
due  to  either  noise  or  sensor  inaccuracy.  One  can  obtain  a  more  reliable  value  through  fusion  among 
them.  Complementary  sensor  integration  uses  sensors  returning  partial  information  on  the  physical  entity 
sensed.  The  use  of  PMMW  and  LADAR  belongs  to  this  type. 

In  this  development,  the  following  capabilities/features  are  desired; 

•  Integration  should  best  utilize  the  information  from  each  sensor,  extract  the  most  useful 
features/evidence  for  decision  making  (either  target  detection  or  selection). 

•  The  system  should  degrade  only  slightly  when  one  sensor  system  fails  due  to  adverse  weather, 
countermeasures,  or  other  causes. 

•  New  sensors  or  new  classifier  components  should  be  able  to  be  integrated  into  the  existent  system 
easily.  This  means  that  the  design  should  be  in  modules  and  the  fosion  should  give  soft  decisions 
(i.e.,  possibility  for  each  proposition  instead  of  only  “yes’  or  “no”). 

Figure  2  shows  a  general  diagram  of  sensor  fusion  with  capability  of  adding  sensors  or  algorithm 
components  (for  evidence  detection).  The  diagram  shows  two  levels  of  fosion  —  the  algorithm/evidence 
level  and  the  sensor  level.  The  evidence/algorithm  level  permits  ease  of  software  modification.  For 
example,  one  can  combine  the  results  from  two  types  of  classifiers  and  add  additional  classifier 
components  without  requiring  major  change  to  previously  designed  software.  The  sensor  level  makes  it 
possible  to  fuse  information  with  additional  sensors,  either  internal  or  external. 
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Figure  2.  A  general  structure  for  sensor  fusion 


rv.  Evidences  for  Target  Detection 

4.1  Evidences  from  PMMW  and  LADAR 

In  the  millimeter-wave  spectral  region,  metal  objects  have  extremely  low  emissivity  values  and, 
hence  are  very  reflective  of  millimeter  wave  energy.  Metal  objects  mainly  reflect  the  very  small  amount 
of  energy  coming  from  the  sky,  which  can  be  thought  of  as  radiometrically  very  cold.  In  contrast,  the 
soil,  trees  and  most  ground  objects  emit  MMW  and  appear  relatively  hot.  Consequently,  the  cold 
temperature  detected  in  a  PMMW  image  is  an  evidence  of  a  possible  metal  target. 

LADAR  range  images  on  the  other  hand  provide  3-D  information.  Objects  above  the  ground, 
structured  edges/surfaces,  surfeces  vertical  to  the  ground,  specific  dimensions,  etc.,  indicate  the  existence 
of  possible  targets. 


4.2  Detection  of  Evidences 


Three  types  of  evidences  were  used  in  the  illustration  of  the  experimental  results  described  below. 
These  were  cold  temperature  regions  in  PMMW  images,  and  jump  edges  and  vertical  surfaces  in  LADAR 
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images.  The  cold  areas  in  a  PMMW  image  provide  the  regions  of  interest  (ROI)  for  the  LADAR  system. 
The  LADAR  system  checks  each  of  these  cold  regions  for  the  jump  edge  and  vertical  surface  evidences. 
The  detection  of  cold  temperature  employs  the  histogram  information  extracted  from  the  PMMW  image. 
A  cutoff  is  selected  near  the  location  at  which  the  histogram  curve  starts  to  increase  quickly. 

Jump  edges  are  detected  along  each  column  of  pixels.  The  range  difference  R(tow,  column)- 
R(row+l,  column)  is  used  to  see  if  there  is  a  jump  edge  (R  denotes  the  range  value).  For  a  tall  object,  the 
range  jump  will  be  large.  If  the  heights  of  the  targets  to  be  detected  are  known,  the  range  differences  for 
possible  jump  edges  can  be  estimated.  This  can  be  used  to  eliminate  objects  that  are  either  too  low  or  too 
high  from  target  detection  consideration.  By  placing  limits  on  the  range  difference,  jump  edges  of  a 
building  can  be  removed. 

Another  evidence  used  in  the  experiments  is  the  existence  of  vertical  surfaces.  The  absolute  value 
of  the  range  difference,  |/?(row,  column)-/?(row+l,  column)|,  is  used.  A  small  range  difference  is  a 
criterion  used  in  detecting  vertical  surfaces. 

4.3  Diagram  for  Target  Detection 

For  target  detection,  the  PMMW  is  used  to  detect  metal  objects  and  the  LADAR  looks  into  the 
details  in  those  areas  for  verification.  The  diagram  with  three  evidences  mentioned  above  is  shown  in 
Figure  3  to  illustrate  the  detection  task.  Each  evidence  is  detected/evaluated  using  a  program  module. 
The  results  are  integrated  through  the  fusion  process. 
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other  possible 
local  decisions 


Figure  3.  Diagram  for  target  detection 


V.  Experimental  Results 

For  experiments,  five  PMMW  and  five  laser  range  images  were  generated  employing  the  IRMA 
multi-spectral  target  and  background  scene  simulation.  All  images  were  for  the  same  scene  at  different 
distances.  The  depression  angle  was  20  degrees  and  the  distances  were  1000m,  800m,  600m,  400m  and 
200m.  These  images  are  shown  in  Figures  4  and  5. 

In  a  PMMW  image,  low  energy  areas  can  be  obtained  by  simply  selecting  an  appropriate  cutoff 
value  according  to  the  image  histogram.  This  value  should  be  located  near  the  point  where  the  histogram 
curve  starts  to  sharply  incline.  One  typical  histogram  curve  is  shown  in  Figure  6.  When  the  distance  is 
longer,  the  image  becomes  less  clear  and  the  cutoff  point  should  be  selected  at  the  higher  end.  Figure 
7(al),  (bl),  ...  (el)  show  the  results.  It  is  seen  that  in  Figure  7(el)  some  blobs  do  not  belong  to  targets 
and  some  targets  are  missing.  This  is  due  to  the  stronger  effects  of  blurring  because  of  the  relatively 
smaller  target  image  at  the  longer  distance.  Validation  should  begin  with  the  right-size  blobs  if 
computational  time  is  critical. 
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(a)  The  scene  with  12  targets  (b)  PMMW  image  at  1000m 


(c)  PMMW  imaee  at  800m 


(e)  PMMW  image  at  400m 


Figure  4.  The  scene  and  the  PMMW  images 


20-10 


(a)  LADAR  image  (1000m) 


(b)  LADAR  image  (800m) 


(c)  LADAR  image  (600m) 


(d)LADAR  image  (400m) 


Figures.  LADAR  images  at 

(a)  1000m 

(b)  800m 

(c)  600m 

(d)  400m  and 

(e)  200m 


(e)  LADAR  image  (200m) 


Figure  6.  Typical  histogram  curve  for  the  observed  scene 
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(al)  PMMW  after  thresholding  (200m) 


(a2)  Jump  edges  from  LADAR  (200m) 


(b2)  Jump  edges  from  LADAR  (400m) 


(b3)  Vertical  surfaces  from  LADAR  (400m) 


(cl)  PMMW  after  thresholding  (600m) 


(c2)  Jump  edges  from  LADAR  (600m) 


(d2)  Jump  edges  from  LADAR  (800m) 


(d3)  Vertical  surfaces  from  LADAR  (800m) 


(e2)  Jump  edges  from  LADAR  (1000m) 


(e3)  Vertical  surfaces  from  LADAR  (1000m) 


Figure  7.  Evidences  of  targets  after  image  processing  (at  200,400,600,800  and  1000m) 
(al)(bl)(cl)(dl)(el)(e2)  Cold  regions  from  PMMW  images 
(a2)(b2Xc2)(d2)(e2)  Jump  edges  from  LADAR  images 
(a3)(b3)(c3)(d3)(e3)  Vertical  surfaces  from  LADAR  images 
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VI.  Data  Fusion 


Fusion  is  necessary  when  separate  decision  modules  make  decisions.  Majority  vote,  the  AND-OR 
technique,  Baysian  inference,  neural  network  approach  and  the  Dempster-Shafer  method  are  some  typical 
ways  for  data  fusion  [7].  Among  these,  the  first  two  give  hard  decisions  and  the  others  provide  soft 
decisions.  A  soft  decision  is  one  that  assigns  a  possibility  to  each  proposition. 

In  the  following,  the  Dempster-Shafer  method  [8]  [9]  is  introduced.  Assume  that  there  are  two 
decision  modules  i  andy,  and  A,  X,  and  Y  are  propositions.  The  possibility  mass  m,{X)  is  assigned  for  X 
by  /  and  mj{Y)  is  assigned  for  Fbyy.  The  Dempster-Shafer  method  will  combine  the  information  from  i 
andy  and  calculate  the  possibility  mass  ffi, ;,(/!)  for  A  as 

Z  m^iXyn  (Y)  Z  m,(JO/w,(f) 

...(^)=£Qizi - ^ 

l-ZKp  1-  Z  mi(X)m(Y) 

where  is  the  sum  for  the  conflicting  ones  (i.e.,  A?^7=^) .  Some  examples  for  propositions  are  {tank}, 
(tank  u  truck},  {buildingwtruck},  etc.  Intersection  of  two  sets  usually  gives  a  smaller  set  or  even  an 
empty  set.  For  instance,  {tank}o{tank  truck}  =  {tank}  and  {tank}n{building>^ruck}= 

The  method  can  be  used  to  combine  the  decisions  from  individual  sensors  to  determine  the 
overall  possibility  of  an  interesting  spot  being  or  not  being  a  particular  target.  For  example,  since  the 
metal-roof  building  blob  in  our  experimental  PMMW  images  is  large,  the  PMMW  system  would  assign  a 
small  value,  say  0. 1,  to  the  possibility  of  it  being  the  desired  target,  i.e.,  »2, (target)  =  0.1.  However,  since 
the  area  is  a  metal,  the  system  may  not  want  to  exclude  the  possibility  of  it  being  the  target.  Thus  it  may 
assign  /«i(nontarget)  =  0.2.  Consequently,  the  unassigned  value  0.7  (=1-0. 1-0.2)  will  go  to  /«/(©),  where 
©  denotes  either  target  or  nontarget  (i.e.,  uncertain).  On  the  other  hand,  the  LADAR  may  find  that  the 
object  (the  building)  is  too  high  and  has  good  confidence  to  assign  a  big  value  for  the  possibility  of  it 
being  a  nontarget.  The  possibility  values  may  be  /W2(target)  =  0.1,  /M2(nontarget)  =  0.8  and  mA@)  =  0.1. 
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The  Dempster-Shafer  method  can  then  combine  the  information  from  these  two  sensor  systems.  The 
results  are  shown  in  Table  1 . 

Table  1.  Calculation  based  on  Dempster-Shafer  combination 


/«2(nontarget)=0 . 8 

m2(©)=0.1 

AMj2(target)=0.01/0.9 

Kl=0.08 

OTy2(target)=0.0 1/0.9 

/«y(nontarget)=0.2 

K2=0.02 

m;2(nontarget)=0. 16/0.9 

»ii2(nontarget)=0.02/0.9 

mi2(nontarget)=0.56/0.9 

/m/2(©)=0.07/0.9 

All  entries  in  the  table  are  calculated  using  equation  (1).  K1  and  K2  are  for  empty  sets  (conflicting 
propositions)  and  are  not  valid  possibility  mass.  In  this  example,  the  value  of  (l-2Xp)  is  0.9.  The  total 
possibility  mass  for  each  of  /M/^Ctarget),  /w/2(nontarget)  and  /m/2(0)  is  the  sum  of  those  corresponding  ones 
in  the  above  table,  i.e., 

»2i2(target)  =  0.01/0.9+0.07/0.9+0.01/0.9  =  0. 1 
/Ky2(nontarget)=  0.16/0.9+0.56/0.9+0.02/0.9  =  0.74/0.9 
/M/2(©)=0.07/0.9 

Note  that  the  sum  of  the  above  three,  /w;2(target)+/M;2(nontarget)+  remains  equal  to  1  due  to  the 

normalization  through  the  division  of  (1-SAp).  The  method  to  assign  nii’s  must  be  designed.  One 
possible  approach  is  to  use  fuzzy  rules. 

The  Dempster’s  rules  of  combination  are  both  communicative  and  associative  [10].  The  order  of 
combination  is  not  important  to  the  final  results. 

VII.  Discussion  and  Conclusions 
7.1  Possible  Extension  on  the  Study 

Further  extension  can  be  in  two  aspects  —  a  thorough  study  on  detail  target  detection,  and  the  study 
on  target  selection. 
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Target  Detection 


In  the  short  ten-week  period,  an  algorithmic  framework  was  developed  and  employed  to  illustrate 
the  conceptual  advantages  of  integrating  PMMW  and  LADAR  for  target  detection.  Further  study  should 
be  directed  at  the  following: 

•  Possibility  value  assignment 

The  Dempster-Shafer  method  was  suggested  for  data  fusion.  This  method  requires  the 
assignment  of  a  possibility  value  for  each  proposition.  One  possible  approach  for  generating  the 
possibility  mass  at  the  individual  sensor  level  is  to  use  fuzzy  rules.  The  design  of  this  portion  will  be 
critical  to  the  overall  system  performance. 

•  Refinement  based  on  current  accomplishment 

The  following  are  example  problems  that  can  be  investigated  further. 

Extending  the  detection  distance 

The  detection  beyond  1000m  becomes  harder  by  using  the  simple  thresholding  technique.  A 
more  sophisticated  technique  may  be  needed  to  increase  the  detection  distance. 

-  Optimizing  parameters  used  in  detection 

One  example  is  the  selection  of  cutoff  for  detecting  “cold”  areas  in  PMMW  images. 

Using  additional  evidences 

Only  three  evidences  were  used  in  the  study.  Questions  to  be  answered  include  whether  more 
evidences  will  help  and  what  is  the  “best”  set  of  evidences,  and  to  what  degree  does  selection 
of  a  particular  set  of  evidences  affect  performance  and  computational  loads. 

•  Exploring  more  complicated  environments 

Testing  schemes  in  more  complicated  environments  should  be  performed.  Through  such  study,  it 
is  possible  to  better  understand  the  performance  of  a  design  and  also  discover  potential  problems 
that  must  be  solved. 

Tarset  Selection 

This  10-week  study  focused  on  target  detection.  After  targets  are  detected,  one  of  them  will  be 
selected  for  attack.  The  missile  system  will  need  to  know  more  details  about  targets  if  it  wants  to  do 
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selection.  For  instance,  the  missile  may  prefer  to  attack  a  TEL  or  tank  than  some  other  type  of  vehicle. 
Since  LADAR  has  high  resolution,  it  is  more  adequate  for  this  type  of  decision  making.  Additionally,  the 
LADAR  can  provide  a  range  image  as  well  as  a  reflectance  image.  Fusion  using  these  two  types  of 
images  is  a  research  problem  deserving  further  investigation.  The  fusion  could  provide  some  advantages 
yet  imknown.  At  shorter  distance,  the  PMMW  may  be  able  to  acquire  some  degree  of  size  information. 
Such  information  could  become  useful  in  case  LADAR  fails  due  to  weather  conditions  or  other  causes. 

7.2  Conclusions 

In  this  study,  fusion  of  PMMW  and  LADAR  data  was  performed.  The  focus  was  on  target 
detection.  PMMW  is  first  used  to  detect  metal  objects  and  then  LADAR  checks  those  regions  of  interest 
for  other  evidences  of  existence  of  a  target.  Advantages  achieved  by  integrating  these  two  sensors 
include  the  reduction  of  task  complexity  and  the  improvement  of  reliability,  both  due  primarily  to  the 
efficient  localization  of  regions  of  interests  employing  the  PMMW.  Since  PMMW  can  penetrate  fog, 
cloud,  smoke,  etc.,  the  combined  system  has  a  near  all-weather  capability.  The  fi'amework  of  the  fusion  is 
based  on  the  Dempster-Shafer  decision  method.  The  fusion  may  be  done  at  the  algorithm  level  and  at  the 
sensor  level.  Determination  of  possibility  mass  for  each  proposition  for  each  sensor  is  a  required  work, 
and  represents  a  major  topic  for  suggested  study  extension.  Other  issues  on  further  extension  were 
addressed  in  the  previous  subsection. 
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Boundary  Conditions 
for 

Direct  Numerical  Simulation  of  Turbulent  Flow 


Chaoqun  Liu 
Professor 

College  of  Engineering  and  Science 
Louisiana  Tech  University 


Abstract 


The  computational  results  of  FDL3D  flow  solver  using  the  high-order  compact  scheme  and 
high-order  filter  for  some  fundamental  flows  are  reported  for  code  verification.  The  boundary 
conditions  in  a  curvilinear  coordinates  for  inflow,  outflow,  far-field,  and  solid-wall  surface  is 
derived  and  discussed  with  some  computational  results. 
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1  Introduction 


Most  important  flows,  such  as  transition,  turbulence,  vortex  dynamics,  etc.  which  Air  Force 
is  interested  in,  are  involved  in  a  time-dependent  process  that  requires  a  time-accurate  numer¬ 
ical  prediction,  such  as  direct  numerical  simulation  (DNS)  or  large  eddy  simulation  (LFS). 

As  well-understood  now,  DNS  and  LES  are  required  to  have  minimum  errors  in  wave  mag¬ 
nitude  and  phase.  However,  low  order  up-winding,  hybrid,  or  other  schemes  can  introduce 
serious  dissipation  and  dispersion  even  for  a  single  wave  and  are  therefore  not  encouraged 
to  use  for  DNS  or  LES.  Although  second-order  LES  calculation  is  still  popular  in  literature 
nowadays,  but  is  fundamentally  incorrect. 

As  pointed  by  Hirsch  (1997),  a  historic  mistake  was  made  (Fromm,  1964)  and  was 
quoted  by  many  text-books  (Roache,  1972)  that  stated  central  differencing  is  not  stable 
for  convection-diffusion  equations  unless  the  Peclet  number  is  less  than  2.  This  wrong  state¬ 
ment  prevented  CFD  community  from  using  high-order  central  differencing  for  decades.  In 
fact,  the  Peclet  number  has  nothing  to  do  with  the  stability  and  many  central  differencing 
schemes  are  stable  when  the  Peclet  number  is  greater  than  2,  but  may  get  some  non-physical 
oscillating  which  is  caused  by  some  unresolved  high-frequency  waves.  However,  these  high- 
frequency  waves  can  be  restricted  or  filtered  by  using  proper  filters.  The  difference  between 
artificial  damping  and  filter  is  that  the  former  adds  some  artificial  viscosity  to  the  Navier- 
Stokes  equations  or  changes  the  governing  Navier-Stokes  equations,  but  the  later  only  change 
the  solution  by  filtering  some  unresolved  waves  which  is  only  related  to  the  solution  and  has 
no  changes  to  the  Navier-Stokes  equations.  High-order  central  differencing  scheme  with  filter 
is  much  more  reasonable  approach  than  up-winding  or  central  differencing  tied  with  artificial 
damping  approaches. 

The  high-order  compact  central  differencing  approach  has  appeared  for  years  (Strikwerda, 
1989),  but  Lele  (1992)  first  systematically  used  it  with  filter  for  CFD.  Recently  Visbal  and 
Gaitonde  (1997,  1998)  gave  general  approach  to  develop  both  high-order  compact  scheme 
and  high-order  filter  (up  to  10th  order).  If  this  approach  is  verified  and  validated,  it  could 
be  a  new  direction  to  conduct  CFD  calculation  for  time-dependent  flow  and  may  replace 
many  old  approaches  such  as  up- winding,  TVD,  central  with  damping,  hybrid,  etc.  The 
verification  and  validation  is  one  of  the  purposes  of  my  work  for  the  summer. 

Actually,  my  group  used  the  high-order  scheme  with  special  multi-dimensional  filter  (  we 
call  them  fine-coarse-fine  grid  mapping)  to  successfully  simulate  the  whole  process  of  flow 
transition  on  flat  plate  5  years  ago  (Liu  et  al,  1993,  1995)  and  then  for  flow  transition  around 
airfoils  with  all  speeds  recently  (Liu  et  al,  1997,  1998).  After  5  years,  it  is  still  hard  to  find 
anyone  else  who  can  repeat  our  work. 

DNS /LES  for  complex  geometry  was  criticized,  but  now  an  exponentially  increasing  num¬ 
ber  of  researchers  are  working  on  this  direction.  The  fundamental  problem  with  DNS/LES 
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seems  to  be  cleared  and  our  experience  shows  DNS  can  be  used  for  flow  transition  and  LES 
can  be  used  for  fully  developed  turbulent  flow  for  a  spectrum  of  engineering  applications. 

The  other  purpose  of  this  work  is  to  develop  appropriate  boundary  conditions  for  time- 
dependent  Navier-Stokes  equations  which  is  compatible  and  can  maintain  high-order  accu¬ 
racy  for  the  solution.  The  characteristic  boundary  conditions  for  Euler  system  can  be  found 
in  a  number  of  literatures  (Whitfield,  1983;  Thompson,  1987,  1990).  For  Navier-Stokes  sys¬ 
tem,  Poinsot  and  Lele  (1992)  have  tested  some  approaches  similar  to  characteristic  boundary 
conditions,  but  for  Cartesian  coordinates  only.  For  time-dependent  flow  with  perturbation, 
many  authors  suggested  to  use  sponge  boundary  condition  (Fasel,  1990;  Kloker  et  al,  1993; 
Collis  et  al,  1996).  We  should  develop  characteristic-like  boundary  conditions  for  curvilinear 
coordinates  with  sponge  boundary. 


2  Boundary  conditions  with  compatibility 

The  characteristic  boundary  conditions  for  Euler  system  cannot  be  directly  used  for  Navier- 
Stokes  equations.  Actually,  there  is  no  characteristic  relation  in  Navier-Stokes  system.  How¬ 
ever,  in  some  certain  circumstances,  such  as  free  stream,  far  field,  etc.  where  the  viscosity 
is  less  important,  the  characteristic  relation  can  still  be  used.  Poinsot  and  Lele  (1992)  gave 
some  ideas  which  use  the  original  N-S  equation  but  replace  some  convection  terms  by  char¬ 
acteristic  relation.  However,  they  only  gave  the  formula  for  Cartesian  coordinates.  Here,  we 
derive  these  relations  in  curvilinear  coordinates. 


2.1  Governing  Equations 


The  three-dimensional  Navier-Stokes  equations  in  generalized  curvilinear  coordinates  (^,  t),  C) 
can  be  written  as 


IdQ  d{E-E,)  d{F-F,)  d{G-G„) 

J  dt  ^  drj  d(i 

The  flux  vectors  for  compressible  flow  are 


Q 


E 


(  ^  \ 

pu 

pv 

pw 

\  «  / 

(  \ 

J  pUu  +  p^x 

-  pUV+piy 

pUw-\-pi2 

V  u{e  +  p)  / 


(1) 
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pV 

pVu  +  prjj; 
pVv  +  prjy 
pVw  +pT]z 

V{e  +  p) 
pW 

pWu  +  p(x 

PWV+  pCy 
pWw  +p(z 
W{e  +  p) 


where 


j  Txx^x  "I"  '^yx^y  "I"  Tzx^z 
~j  "^xyCx  "t"  "^yy^y  "h  '^zy^z 
"^xz^x  “I"  '^yz^y  "I"  Trz^z 
^  Qx^x  +  +  9z^z 

/  0 

2  '^xx'Hx  "h  '^yx^y  “I"  Tzaj^z 
—  Txy^x  "I"  "^yyVy  "I"  '^zy'Hz 
TxzPx  "I"  '^yz^y  ”l"  '^zz'Hz 
\  (Ixflx  "H  9y^y  "h  9z^z 
/  0 

'^xxCx  "I"  ’^yxCy  "J"  “^zxCz 
~j  '^xyCx  "t"  ^yyCy  “f"  '^zyCz 
TizCr  "I"  ’^yzCy  '^zzCz 
^  QxCx  "i"  9yCy  "h  QzCz 


U  =  U^X  +  v(y  +  W^z 
V  =  UPz  +  VPy  +  WPz 
W  =  u(x  +  vCy  +  wCz 

J  is  the  coordinate  transformation  Jacobian  matrix: 

d{x,y,z) 


=  — ^  +  \p{u^  +  +  w"^) 

^  —  1  Z 


2/i  du  ,  du  .  du.  ,.dv  dv  dv.  ,c  ^ 

~  dC  ^  dC 

2m  r^/>  .  dv.  ,.du  du  du  dw  dw 

~  '^'^^dp'^  ^*'5C  ^  ^^dp  d^  dp^ 

2m  .  dto  .  dw.  dv  ,  dv  dv  te^u  d«  d« 
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(5) 


H  du  du  /.  .  dv  dv  dv.^ 

Txy  =  +  +  +  +  + 

fi  j  du  du  du  (t-dw  ^ 

fi  ..  dw  dw  dw.  ,  dv  dv  dv  ^ 

ryz  =  +  +  +  +  + 

n  dT  dT  ^  dT^ 

q,  _  +  VT,y  +  «;t„  +  dC  ^ 

_  dT  dT  ,  ar, 

qy  -  UT,y  +  VTyy  +  WTy,  +  Q^  +  dC  ^ 

n  _  dT  dT  ,  dT^ 

qz  =  UT,.  +  .TV.  +  «;t,.  +  +  ^.-  +  C^) 

All  the  equations  are  cast  in  non-dimensional  forms,  and  the  four  dimensionless  parameters 
resulted  from  non-dimensionalization  are  defined  as: 


Mr  = 


^fW: 

p:u:l‘ 


Pr  = 


7  = 


The  sup-star  '*'  represents  reference  values.  R,  Cp  and  Cv  are  the  ideal  gas  constant,  the 
specific  heats  at  constant  pressure  and  constant  volume.  Sutherland’s  law  in  dimensionless 
form  is, 

_^|1±5  110.3 

^  T  +  S'  ^  T* 


2.2  Compatible  boundary  conditions 


Using  the  characteristic  analysis  [15],  in  which  hyperbolic  terms  in  ^  direction  of  Eq.  (1)  are 
modified,  we  can  recast  this  system  as: 

dp  ,  ^.dp  ,  du  du  du.  du  du  .  du. 

^ +  i,.|E + =  0 

dt  dq  P  dq  dC  p  d( 

dv  ,  ..^dv  1  dp  rrrdv  ^  ^  dp  ,  . 

^  +  ‘^3  +  U^  +  -7„^  +  l^^  +  -C.^  +  ms3  =  0  (12) 

dw  ,  ,rdw  1  dp  „rdw  1  ^  dp 

+  d4  +  V — I — qz-T — |-  W +  t;*S4  =  0 
dt  dq  p  dq  d^  p  dC 

dp  .  ,rdp  ,  du  du  du.  ,,  du  ^  du  ^  du. 
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where  vector  d  is  obtained  from  characteristic  analysis,  and  can  be  expressed  as 

(  di\  (  ^[2(^1  +  ^5)  +  A]  '' 

^2  -  Cl)  -  ^{iyCz  +  6 A) 

dz  =  ^{C,-Ci)  +  ^Ul  +  ^l)Cz-UyC.]  (13) 

d-A  ~  ~  iy)^A 

\dz  /  y  \{Ci  +  £5)  / 

where  c  is  sound  speed,  and 

p = \/e+«2+ff  (!■*) 

£<s  are  the  amplitudes  of  characteristic  waves  corresponding  to  each  characteristic  velocity 
Aj,  which  are 

Ai  =  (15) 

A2  =  A3  =  A4  =  £/  (16) 

As  =  C/  -  Q  (17) 

Ci  is  defined  as  Eq.  (??)  and  £-s  can  be  expressed  as 

^  xr  du  .  dv  _  ^  dw.  ,  dp. 

Cl  =  {U-C^)[ 


^dp  dp 


r  TTl  2^^  ''t'\ 

Cl  -  U(c 

£3  =  +  (18) 

A  =  C'(-6|+6|) 

.  ^  ,rPc,^  du  ^  dv  .  dw,  .  dp, 

£5  =  +  +  +  + 

The  terms  uts-s  in  Eq.  (12)  represent  viscous  terms,  they  are  given  by  transformation  of 
initial  viscous  terms  in  Eq.(l) 


where  P  is  matrix  as 


m33  =-•'^(-37-+ + 


+  v^  +  w^)  (1-7)“  (1-7)^^  (1-7)^"  7-1 
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In  other  two  directions,  analysis  should  be  the  same.  In  ^  direction,  the  modified  N-S 
equation  should  be 

do  dp  ,  du  du  du.  ^rdo  du  ^  du  .  du. 

^  +  ,.3^)  +  +  ms,  =  0 

du  ,  ^.du  1  dp  ^jdu  \ ^  dp 

^  +  <(,  +  =  0 

dv  ,  ^.dv  1  dp  j^dv  1  dp  „  /«,x 

dt  dr]  p  df]  d^  p  d^ 

9p  ,  „rdp  ,  du  du  du.  dp  t  1  •  n 

^ + yHvs-^ + ’'*9^  +  + «•  af  +  «' ae> + =  " 


+  ■^s)  +  ^2] 

2^(£5  -  Cl)  +  ^[(Cy  +  Q)^Z  -  CxCyA] 

^(£5  -  Cl)  -  ^[{UyCz  -  {Q  +  C)£4] 
^(£5  -  ■Cl)  -  ^[Cx£3  +  CyA] 

|(£l  +  £5) 


^  xr  pc,^ 

(H"  -  cd[-ji.<s-^  +  ^»ac  +  ^‘ac  >  +  ac' 

'  a<  a<' 

H'C-cl+c.l) 


=  v'CJ 


Q  =  c/? 


In  r]  direction,  modified  N-S  equation  should  be 

^  +  di  +  +  p(Cx^  +  Cx^  +  Cx^)  +  U-^  +  p(^x^  +  ^x;^  +  ^X^)  +  msi  =  0 

du  ,  „rdu  I  .  dp  ^^du  dp  .  „ 

;^  +  d2  +  H^^  +  -Cx^  +  t/;^  +  -^x^  +  ms2-0 

dv  ,  ,„rdv  1  .  ,rdv  dp  .  „ 

^  +  d3  +  I^^  +  -Cy^  +  f/^  +  -^y^  +  «l«3  =  0  (26) 
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dw  ,  „,dw  ,  1  -  1  dp  .  _ 

—  +  (i4  +  VT  +  VtS4 


dp 

di 


+  ^  +  H't  +  rP«.S  +  oS?  +  O  +  t'i  +  +  f-S + + -^  =  » 


du 


du 


du 


dp 


du 


du 


du. 


dC 


"  ^^dC'  d^ 


'd^  '  ^^dC 


(  di\ 

d2 

dz 

= 

d^ 

K  <^5  > 

Cl 

== 

C2 

= 

^3 

= 

£4 

= 

^5 

=: 

f  ^[|(A  +  A)  +  A]  \ 

^(£5  -  Cl)  +  +  ‘nDC^  -  r}xr}zCA] 

2^  (£5  —  ^1)  —  ^[tIxCz  +  rizC^ 

^^{C^  —  Cl)  —  -^[VxilzCs  —  ivl  +  ^v)A] 

^  \(Ci  +  A) 

.  ^  pc,  du  dv  dw.  .  dp, 

(V  -  ^(’>*3^ + ’'»a^  + 

vic-‘^  -  % 

^  dr)  dr] 

dv  du 

5t;  dw 

(V  +  C,)(  +  V.^)  +  5^1 


(27) 


(28) 


P  =  \l’7l+’l^+’l. 

C,  =  cff 


(29) 

(30) 


2.2.1  Subsonic  Inflow  Boundary  Condition 

For  subsonic  inflow  at  ^  =  1,  four  quantities  should  be  imposed.  We  choose  it,  u,  ta,  T  as  these 
quantities,  but  density  p  is  obtained  by  Eq.  (12)  .  At  this  boundary  four  characteristic  waves 
£2,  £3,  £4,  £5,  are  entering  the  computational  domain,  while  £1  is  going  outward.  Then  £1 
can  be  calculated  from  interior  points  using  Eq.  (18)  by  a  third  order  finite  difference  scheme. 
£2,  £3,  £4,  £5  are  given  by  [15]  [12] 


du  dv 

~  W  ” 

(31) 

^  du  z.  dw 

(32) 

(33) 

(34) 
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2.2.2  Subsonic  Outflow  Boundary  Condition 

For  subsonic  outflow  at  ^  =  Nx,  static  pressure  poo  of  far  field  is  given.  At  this  boundary, 
four  characteristic  waves  C21 C3,  £4,  £5,  are  leaving  the  computational  domain,  while  the  left 
one  £1  is  entering  the  field.  So  that  £2,  £3,  £4,  £5  can  be  calculated  from  interior  points 
using  Eq.  (18)  by  a  third  order  finite  difference  scheme,  while  the  other  one  is  given  by 


£1  =  K{p-poo) 


(35) 


2.2.3  Far  Field  Boundary  Condition 


For  fax  field  boundary  at  ^  =  Nz,  the  directions  of  characteristic  waves  are  determined 
automatically  by  local  values,  then  the  £,  of  outgoing  waves  are  calculated  from  interior 
points  using  Eq.  (23),  and  those  going  inward  waves  are  set  to  zero.  i.e. 


f  £,  forXi  >  0 
(  0  forXi  <  0 


(36) 


2.2.4  No-Slip  Wall  Boundary  Condition 


For  no-slip  wall  at  ^  =  1,  velocities  u,v,w  are  set  to  zero,  £3,  £4  are  given  by 

£3  =  -  -7.C.)|r  +  (f.<.  -  (.<.)%  (37) 

£4  =  +  (f.Cv  -  (38) 


The  characteristic  wave  £1  is  going  outward,  so  it  can  be  calculated  from  interior  points,while 
the  in-going  wave  £5  is  given  by 


£5  =  £1  +  ^(^(C.£3  +  (.C„l)  -  +  6||)l 


(39) 


for  adiabatic  wall,  £2  should  be 


for  isothermal  wall,  £2  should  be 


£2  =  0 


(40) 


£3  =  2(7  -  i)(£i  +  £5) 


(41) 


When  all  £,•  at  each  boundary  are  obtained,  d,-  can  be  easily  calculated  from  Eq.  (13), 
Eq.  (22),  Eq.  (27).  In  Eq.  (12),  Eq.  (21),  Eq.  (26),  apart  from  d,-,  all  other  terms  are 
computed  just  as  interior  points.  Boundary  Eq.  (12)  Eq.  (21),  Eq.  (26)  are  linearized  and 
solved  implicitly  with  interior  points. 
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2.3  Sponge  boundary  conditions 

Using  sponge  layer  actually  leads  to  an  exponential  decay  of  reflection.  The  idea  to  use 
sponge  boundary  condition  to  eliminate  wave  reflection  at  inflow,  outflow  and  fax-field  was 
initiated  by  Fasel  (1990).  The  basic  idea  can  be  described  as  follows: 

If  the  governing  equation  can  be  abbreviated  as 


dU 

dt 


+  Lt/  =  0 


In  the  outflow  sponge  layer  we  may  change  the  equation  to: 


(42) 

(43) 


^  +  LU-h{x)U'  =  (44) 

at 

(45) 

where  U'  is  a  perturbation  of  U  (//'  =  U  -  Uo).  This  will  no  doubt  add  very  large  damping  to 
the  perturbation  in  the  sponge  layer.  Here  we  can  use  a  formula  given  by  Collis  k  Lele  (1996): 


fd{x)  =  { 


0 


X  €  (xjjXo] 

otherwise 


(46) 

(47) 


Here  x,  denotes  the  start  of  the  sponge  and  xo  denotes  the  location  of  the  outflow  boundary. 
The  constants,  A,  and  control  the  amplitude  and  strength  of  the  sponge  function.  We 
expect  that  the  sponge  function  only  help  eliminate  the  non-physical  wave  reflection  from 
the  boundary  but  has  little  effect  to  physical  solution  outside  the  sponge.  Usually,  we  take 
one  T-S  wavelength  as  the  sponge  layer. 

The  sponge  layers  can  be  embedded  in  the  inflow  and  outflow  sections  to  eliminate  the 
reflection  in  the  streamwise  direction.  For  subsonic  flow,  the  inflow  sponge  is  necessary,  while 
for  supersonic  flow,  the  inflow  sponge  can  be  removed. 


3  Code  verification  with  different  filters  and  different 
boundary  conditions 

In  order  to  make  sure  the  FDL3D  code  can  give  converged  results  for  steady  flow  without 
purturbation,  a  number  of  simple  flows  was  selected  to  verify  the  code.  The  flow  and  bound¬ 
ary  conditions  and  convergence  are  described  as  follows: 
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Farfield 


Inflow 

<=:> 


Outflow 


Solid  Wall 


Figure  1:  Boundary  conditions 


Flow  test  conditions:  XM1=0.1,  RE=1000,  TW=1.0,  where  XMl,  RE,  and  TW  refer  to 
Mach  number,  Reynolds  number,  and  temperature  of  the  wall  respectively. 

1.  Periodic  Boundary  in  x-,  y-,  and  z-  directions 

Case  I:  uniform  flow  with  all  periodic  boundaries 
grids=32xl3xl3,  uniform  grids  in  y-direction 
dx=0.0967,  dy=0.0833,  dz=0.0833,  dt=0.01 
x=0-3,  y=0-l,  z=0-l 

max.  error=0.44935E-05  after  1000  iterations 

Conclusion:  The  compact  scheme  code  is  converged  for  streatched  grids 

Caase  II:  uniform  flow  with  stretched  grids 
grids=32xl3xl3,  stretched  grids  in  y-direction 
periodic  in  x,  y,  and  z 

dx=0.17,  dy=0.126  (j=l),  dz=0.315,  dt=0.02 
x=335.9  -  341.1,  y=0-22,  z=0-3.785 
max.  error=0.111e-5  after  1000  iterations 
Conclusion:  The  compact  scheme  code  is  converged 

Case  III:  uniform  flow  with  large  dx 

grids=32xl3xl3,  stretched  grids  in  y-direction 

periodic  in  x,  y,  and  z 

dx=1.7,  dy=0.126  (j=l),  dz=0.315,  dt=0.02 

x=3359  -  3411,  y=0-22,  z=0-3.785 

max.  error=0.1977e-l  after  1000  iterations 

Conclusion:  The  convergence  is  seriously  degenerated  when  dx  is  larger 
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Case  IV:  uniform  flow  with  small  dx 
grids=32xl3xl3,  stretched  grids  in  y-direction 
periodic  in  x,  y,  and  z 

dx=0.017,  dy=0.126  (j=l),  dz=0.315,  dt=0.02 
x=3359  -  3411,  y=0-22,  z=0-3.785 
max.  error=0.1347e-4  after  1000  iterations 
Conclusion:  The  convergence  is  worse  when  dx  is  smaller 

Grid  test  conclusion: 

The  code  is  convergent  for  periodic  boundaries,  but  is  sensitive  to  aspect  ratio  of  grids 


2.  Wall  Bounded  on  top  and  bottom,  periodic  in  x  and  z  directions  (Channel  flow): 

Case  I:  Eighth-order  filter  with  low  order  filter  at  boundary  points 
grids=32xl3xl3,  uniform  grids  in  y-direction 
periodic  in  x  and  z  but  wall  bounded  in  y 
dx=0.0967,  dy=0.0833  (j=l),  dz=0.0833,  dt=0.01 
x=0-3,  y=0-l,  z^O-1 

max.  error=0.67235E-02  after  1000  iterations 
It  is  not  converged. 

Case  II:  Same  as  Case  I,  but  the  filter  is  changed  to  second  order  in  y-direction 
max.  error=0.24073E-03  after  1000  iterations 
max.  error=0.30272E-05  after  2000  iterations 
It  is  converged  well. 

Case  III:  Same  as  Case  I,  but  change  the  filter  to  forth  order  in  J-direction  but  no 

filter  on  j=2  and  j=jdm-l 

max.  error=0.11050E-02  after  1000  iterations 

max.  error=0.63993Ei-03  after  2000  iterations  and  starts  oscillating  after  around  1500 
iterations. 

Case  IV:  Same  as  Case  I,  but  change  the  filter  to  forth  order  in  J-direction  but  second- 

order  filter  on  j=2  and  j=jdm-l 

max.  error=0.68211E-2  after  1000  iterations 

max.  error=0.46214E-02  after  2000  iterations 

Converges,  but  very  slow  with  some  oscillating 

Case  V:  Same  as  Case  I,  but  change  the  filter  to  eighth  order  in  J-direction  but  no 
filter  on  j=2,  4  and  j=jdm-l,  jdm-2,  jdm-3 
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max.  error=0.81015El-03  after  1000  iterations 
max.  error=0.41929E-03  after  2000  iterations 
max.  error=0.28442EI-03  after  3770iterations 
Converges  with  some  oscillating 

Conclusion:  The  current  filter  treatment,  high-order  for  interior  points  but  low-order 
for  boundary  points  does  not  work,  but  it  works  if  we  use  a  filter  with  same  order  (e.g. 
2).  It  gives  a  clue  that  we  have  to  develop  new  approach  for  filter  at  boundary  points. 
It  is  better  than  the  current  approach  if  we  do  not  use  filter  on  near  boundary  points. 

Suggestions:  Not  use  filter  on  near  boundary  points  or  develop  a  new  approach  for 
filter  on  boundary  points  which  is  compatible  and  will  not  generate  noises. 


3.  Wall  Bounded  on  bottom,  symmetric  on  top,  and  periodic  in  x  and  z  directions  (half 
channel  flow): 

Case  I  :  grids=32xl3xl3,  uniform  grids  in  y-direction 

periodic  in  x  and  z  but  wall  bounded  on  bottom  and  symmetric  on  top 

dx=0.0967,  dy=0.0833  (j=l),  dz=0.0833,  dt=0.01 

x=0-3,  y=0-l,  z=0-l 

Second-order  filter 

max.  error=  0.16020Ei-02  after  1000  iterations 
max.  error=:  0.47519E-03  after  2000  iterations 

Converges  well  with  some  oscillating  and  the  solution  sounds  reasonable  (half  channel 
solution). 


4.  Wall  Bounded  on  bottom,  symmetric  on  top,  periodic  in  z-  direction,  characteristic  at 
inflow,  and  extrapolation  at  outlet  (half  channel  flow). 

Case  I  : 

grids=32xl3xl3,  uniform  grids  in  y-direction 
dx=0.0967,  dy=0.0833  (j=l),  dz=0.0833,  dt=0.01 
x=0-3,  y=0-l,  z=0-l 

Characterisctic  at  inflow  (Whitfield’s  formula) 

Second-order  filter 

max.  error=  0.57943E1-01  after  1000  iterations 
max.  error=  0.24919E-01  after  2000  iterations 
It  does  not  converge. 
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Case  II  : 

grids=32xl3xl3,  uniform  grids  in  y-direction 
dx=0.0967,  dy=0.0833  (j=l),  dz=0.0833,  dt=0.01 
x=0-3,  y=0-l,  z=0-l 
Characterisctic  at  inflow  (Lele’s  formula) 

Second-order  filter 

max.  error=  0.16017E-02  after  1000  iterations 
max.  error=  0.45652E-03  after  2000  iterations 

Converges  slow  with  some  oscillating.  The  solution  has  no  negative  v  near  inflow. 

Conclusion:  Whitfield’s  formula  cannot  be  used  for  viscous  flow  near  the  wall.  Lele’s 
formula  for  N-S  equations  can  be  used  for  viscous  flow,  but  converges  very  slow.  We 
may  need  to  use  implicit  inflow  boundary  conditions. 
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Figure  2:  Channel  Flow 


4  Conclusion  and  suggestions 

We  conclude  the  report  as  follows  : 

1.  The  FDL3D  code  with  high-order  compact  scheme  and  high-order  filter  can  work  for 
both  steady  and  time-dependent  Navier-Stokes  equaations 

2.  The  code  (FDL3D)  needs  to  be  improved  for  high  aspect  ratio  grids 

3.  The  code  needs  new  filter  for  boundary  points 

4.  We  can  use  symmetric  boundary  condition  for  far-field  (similar  to  half  channel) 
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5.  We  can  use  extrapolation  for  outflow  boundary 

6.  The  current  wall  boundary  condition  is  fine 

7.  Lele’s  compatible  boundary  condition  with  characteristic  relation  can  be  used  for 
Navier-Stokes  equations  (viscous  flow),  but  may  need  to  be  implicit. 

8.  Use  of  sponge  can  improve  converges  which  is  more  important  for  time-dependent  flow 
to  eliminate  non-physical  reflection 

We  need  to  continue  this  effort  to  develop  appropriate  filters  and  compatible  boundary 
conditions  for  the  FDL3D  code. 
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BroiRECTIONAL  REFLECTANCE  DISTRTOUTION  FUNCTIONS 
DESCRIBING  FIRST-SURFACE  SCATTERING 


Carl  E.  Mungan 
Assistant  Professor 
Department  of  Physics 
The  University  of  West  Florida 


Abstract 


The  bidirectional  reflectance  distribution  function  function  (BRDF)  is  a  measure  of  the 
amount  of  light  scattered  by  some  medium  from  one  direction  into  another.  Integrating  it  over 
specified  incident  and  reflected  solid  angles  defines  the  reflectance,  which  can  be  easily  related 
to  the  absorptance  (or  emissivity)  of  a  sample.  The  BRDF  can  thus  be  taken  as  a  fundamental 
quantity  for  the  optical  characterization  of  an  object  and  it  correspondingly  is  important  in  a 
large  variety  of  applications.  When  a  beam  of  electromagnetic  radiation  (visible,  infrared,  radar, 
etc.)  strikes  a  body,  it  can  scatter  off  the  top  or  first  surface,  as  well  as  from  the  volume  or 
subsequent  interfaces.  However,  the  total  amount  of  light  reflected  from  the  first  surface  depends 
primarily  on  the  complex  index  of  refraction  of  the  illuminated  medium  (relative  to  that  of  the 
incident  medium)  and  is  often  sufficiently  large  that  this  dominates  the  scattering  from  the 
material.  On  the  other  hand,  the  topography  of  this  interface  determines  the  angular  distribution 
of  the  scattered  radiation — smooth  surfaces  reflect  almost  entirely  into  the  specular  direction, 
while  with  increasing  roughness  the  light  tends  to  diffract  into  all  possible  directions.  Ultimately 
an  object  will  appear  equally  bright  throughout  the  outgoing  hemisphere  if  its  surface  is  perfectly 
diffuse  (i.e.,  Lambertian).  Measuring  and  modeling  the  BRDF  can  thus  give  valuable  information 
about  the  nature  of  a  target  sample. 

The  paper  is  org^ized  as  follows.  The  introduction  lists  some  domains  of  study  in  which 
reflectance  plays  ^  important  role,  as  well  as  the  relationship  between  it  and  other  quantities  of 
importance  in  optics.  Next,  the  nomenclature  needed  to  define  and  characterize  the  BRDF  is 
presented,  along  with  some  related  issues.  Following  this,  a  long  section  reviews  the  principal 
analytical  and  numerical  models  used  to  describe  first-surface  scattering;  this  comprises  the  heart 
of  the  present  paper  and  considerable  effort  has  been  expended  to  unify  the  often  disparate 
notations  and  points  of  view  in  the  literature.  Finally,  this  report  ends  with  a  summary  of  select 
experimental  measurements,  most  of  which  are  quite  recent  and  varied  in  style  and  purpose;  this 
gives  a  flavor  for  the  ongoing  efforts  in  this  field. 
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BIDIRECTIONAL  REFLECTANCE  DISTRIBUTION  FUNCTIONS 
DESCRIBING  FIRST-SURFACE  SCATTERING 

Carl  E.  Mungan 


Introduction 

A  variety  of  models  and  measurements  of  reflectance  have  been  described  in  the  literature 
but  their  range  of  validity  is  generally  restricted  by  the  domain  of  interest  of  the  authors. 

Roughly  speaking,  these  domains  can  be  described  as  follows: 

Remote  sensing — aircraft  or  satellite  measurements  of  terrestial  vegetation  and  geography, 
usually  under  ambient  atmospheric  lighting  conditions; 

Astronomy — telescopic  measurements  of  planets  and  moons  in  the  solar  system  illuminated 
directly  or  indirectly  by  the  sun; 

Computer  graphics — development  of  techniques  intended  to  photorealistically  simulate  the 
surface  appearance  of  a  wide  variety  of  materials; 

Military/commercial — field  measurements  under  carefully  controlled  conditions  of  man¬ 
made  objects,  typically  painted  or  bare-metal  air  or  ground  based  targets; 

Optics — goniometric  laboratory  measurements  of  small,  well-characterized  samples. 

Select  models  and  experiments  applicable  to  each  of  these  five  domains  are  discussed  in  this 
paper. 

In  general,  theoretical  models  begin  by  considering  light  to  be  incident  on  a  spot  on  the 
surface  of  an  object  from  a  range  of  solid  angles  Q,  (which  can  vary  from  a  delta  function  for  a 
fiilly  directional  beam  to  2it  for  the  hemisphere)  centered  on  polar  (zenith)  angle  6^  (defined  with 
respect  to  the  macroscopic  surface  normal)  and  azimuthal  angle  (conveniently  defined  with 
respect  to  some  sample  feature  in  the  case  of  an  anisotropic  surface  or  arbitrarily  and  irrelevantly 
for  an  isotropic  one).  This  light  must  be  either  transmitted,  absorbed,  or  reflected  by  the  object; 
the  fraction  of  the  incident  flux  P  (radiant  power  in  W)  which  is  subsumed  by  each  of  these 
mechanisms  is  specified  by  the  dimensionless  ratios  called  the  transmittance  T,  the  absorptivity 
(or  absorptance)  a  (not  to  be  confused  with  the  absorption  coefficient  given  by  -ln[l-a]/L  where 
L  is  the  sample  length),  and  the  reflectance  p,  respectively.  I  will  always  assume  that  the 
illuminated  object  is  opaque,  so  that  T=  0;  accordingly,  a  +  p  =  1.  The  reflected  light  (which  can 
be  more  generally  described  as  scattered  light)  is  collected  by  a  detector  spanning  a  solid  angle 
centered  on  angles  6^  and  The  emissivity  eof  the  sample  is  defined  to  be  the  dimensionless 
ratio  of  the  total  radiant  flux  emitted  by  the  sample  to  that  of  a  blackbody  having  the  same 
geometry  and  temperature.  Kirchhoff  s  law  says  that  e=  a,  as  follows  by  imagining  the  sample 
to  be  enclosed  inside  an  opaque  cavity  whose  temperature  is  equal  to  that  of  the  sample:  this 
equality  means  that  there  can  be  no  net  gain  from  or  loss  to  the  blackbody  environment  by  the 
object.  Furthermore,  by  interposing  narrowband  filters  between  the  sample  and  cavity,  it  is 
evident  that  the  spectral  emissivity  e(A),  defined  in  terms  of  the  ratio  of  emitted  fluxes  in  a  unit 
wavelength  interval  centered  around  A,  must  equal  the  spectral  absorptivity  a(A),  whose 
definition  is  similarly  a  ratio  of  spectral  fluxes.  [Note  that  I  have  eschewed  the  notation  £x,  to 
avoid  confusion  with  the  spectral  emission  coefficient  defined  in  Sec.  13.1.1  of  Hecht  (1998).] 

An  object  is  said  to  be  a  graybody  if  £(A)  is  a  constant  less  than  1.  Nicodemus  (1965, 1970)  has 
argued  that  equality  also  holds  for  the  directional  quantities,  f(0,^)  =  oc(d,4>),  for  emission  into 
some  direction  zuid  absorption  from  the  same  direction.  [One  could  then  cdl  e  the  hemispherical 
emissivity,  equal  to  the  average  of  the  directional  emissivity  over  all  projected  solid  angles, 
analogous  to  Eq.  (2)  below.]  Thus  the  directional  and  spectral  dependence  of  the  emitted  light 
can  be  related  to  the  reflectance  and  temperature  (via  the  Stefan-Boltzmann  law)  of  the  sample. 
Furthermore  the  polarization  of  the  emission  is  seen  to  be  the  complement  of  that  of  the  reflected 
radiation  at  the  same  wavelength,  a  result  which  can  also  be  understood  by  visualizing  an 
emitting  center  as  lying  slightly  below  the  surface  of  the  sample,  so  that  the  light  escapes  in 
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accordance  with  the  Fresnel  relations  [cf.  Eq.  (6)  below];  in  fact,  Sandus  (1965)  has  argued  that 
this  holds  even  for  emitting  centers  lying  on  the  surface.  Consequently,  I  will  mainly  focus 
attention  on  the  reflectance  for  the  remainder  of  this  paper,  even  in  the  infrared  region  where  the 
sample  emission  is  significant  at  the  ambient  temperatures  of  interest. 

Reflectance  Nomenclature 

The  incident  flux  P  per  unit  illuminated  area  A  of  an  object  is  known  as  the  irradiance  (or 
illuminance)  H  =  dPIdA  in  units  of  W/m^  the  corresponding  emitted  or  scattered  quantity  is 
called  the  emittance  or  exitance.  [Note  that  the  element  dA  of  area  is  taken  to  be  small  on  the 
macroscopic  scales  of  interest,  but  large  compared  to  the  sub-resolution  irregularities  in  the 
sample  surface,  and  is  taken  to  be  perpendicular  to  the  macroscopic  surface  normal  (Nicodemus, 
1965).]  For  simplicity,  both  the  illuminance  and  the  exitance  will  be  referred  to  as  the  irradiance 
of  the  surface,  to  be  distinguished  from  the  intensity  of  the  irradiating  beam,  I  =  dP/(dA  cosff), 
where  the  cos0  term  projects  the  element  of  surface  area  dA  into  the  direction  of  propagation  of 
the  beam,  which  is  inclined  at  polar  angle  6  to  the  (macroscopic)  surface  normal.  The  irradiance 
per  unit  projected  solid  angle  is  called  the  radiance  (or  luminance,  which  corresponds  to  the 
photometric  concept  of  brightness),  L  =  dHI{cos6dQ)  =  dl/dQ.  in  units  of  W  m'^  sf ‘.  Here  the 
element  of  solid  angle  is  dQ  =  sin0  dO  d(p',  sometimes  (Nicodemus  et  al. ,  1977)  a  separate 
symbol  is  also  introduced  for  the  element  of  projected  solid  angle  cos  6  dQ,  though  I  will  not  do 
so,  to  keep  things  simpler.  Let  the  subscript  i  refer  to  incident  quantities  and  r  to  reflected 
(scattered)  terms.  The  bidirectional  reflectance  distribution  function  (BRDF)  can  now  be  defined 
as/(0i,^,;(9r,^,.)  =  dLJdHj  =  dPJidPj  cosO,  dQ,)  with  units  of  sr‘;  here  JP,  is  the  incident  power 
illuminating  dA  from  direction  (P„0,)  and  dP^  is  the  radiant  power  reflected  from  dA  into  the 
outgoing  solid  angle  dQ.,  centered  about  the  direction  (0r^0r)-  Usually  the  surface  is  taken  to  be 
(macroscopically)  planar  and  the  incident  beam  to  have  a  spatially  uniform  intensity  profile,  so 
that  one  can  integrate  over  a  macroscopic  area  A  of  the  sample  to  get/=  JP/(P,  cosP,.  d€l,)  where 
now  P,  is  the  incident  power  illuminating  A  and  dP,  is  the  flux  reflected  from  A  into  dQ.,.  For  an 
isotropic  surface, /is  a  function  only  of  (j)^  -  and  not  of  the  two  azimuthal  angles  individually. 
Unlike  reflectances,  which  cannot  exceed  unity,  the  BRDF  can  be  very  large,  becoming 
unbounded  for  a  purely  specular  reflector — cf.  Eq.  (5)  below.  When  necessary,  a  double 
subscript  can  be  added  to /to  denote  the  polarizations  of  first  the  source  and  second  the  detector 
relative  to  the  planes  of  incidence  and  reflection,  respectively.  In  that  case,  these  four  values  off 
can  be  thought  of  as  the  elements  of  a  2x2  matrix  f  and  the  component  intensities  of  the  incident 
and  reflected  radiation  as  two- vectors  such  that  dL,  =  f  <iH,.  However,  this  description  does  not 
suffice  to  determine  the  scattering  for  other  states  of  incident  and  reflected  polarization,  such  as 
circular,  because  the  phase  information  of  the  fields  has  not  been  retained.  One  approach  to 
circumventing  this  limitation  is  to  work  instead  with  the  (complex)  components  of  the  incident 
and  reflected  electric  fields,  E,  and  E,.,  respectively,  which  are  cormected  by  a  2x2  scattering 
matrix  S  (i.e.,  E,.  =  S  E,),  which  is  known  as  the  Jones  calculus  (Ruck  et  al.,  1970).  Alternatively, 
the  Mueller  matrix  and  Stokes  vector  representation  can  be  used,  wherein  f  is  expressed  as  a  4x4 
matrix  and  the  beams  as  intensity  4-vectors  (Flynn  and  Alexander,  1995).  This  latter  approach  is 
needed  for  describing  p^ially  polarized  radiation,  while  the  Jones  notation  is  used  for  coherent 
illumination  and  reflection.  If  the  radiation  is  partially  coherent,  then  a  treatment  in  terms  of  the 
coherency  (or  polarization)  matrix  is  necessary.  For  a  nice  discussion  of  the  relation  between  the 
Jones,  Mueller,  and  coherency  matrices,  with  particular  reference  to  the  ensemble  averaging 
necessa^  for  handling  scattering  from  statistical  media,  see  Kim  et  al.  (1987). 

By  integrating  the  BRDF  over  all  scattered  angles,  one  gets  the  dimensionless  directional- 
hemispherical  reflectance 

p(e,,0,;2;r)=  J/(0.,^.;0^,^^)cose,JQ,,  (1) 

2k 

which  gives  the  fractional  amount  of  flux  reflected  into  the  entire  hemisphere  out  of  that  incident 
from  a  particular  direction.  (The  integral  over  2k\s  of  course  a  shorthand  meaning  integrate  d 
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from  0  to  Tdl  and  (j)  from  0  to  Ik,  for  a  total  solid  angle  of  2k.)  If  instead  one  averages  over  all 
incident  angles,  the  hemispherical-directional  reflectance  is  obtained  as 

dpGn-,e,A) = I  f(B, 4, ;e^,t)cose, da, .  (2) 

^  2iz 


Note  that  the  denominator  is  K  and  not  2k  because  of  the  cos0,  projection  factor.  It  should  be 
clear  from  inspection  of  these  two  equations  how  to  write  other  quantities  of  interest,  such  as  the 
conical-hemispherical  reflectance,  the  bihemispherical  reflectance  (or  albedo),  and  so 
on — Nicodemus  et  al.  (1977)  tabulate  a  variety  of  such  entities.  If  the  prefactor  before  the 
integral  in  Eq.  (2)  is  left  off,  the  resulting  quantity  is  instead  called  the  hemispherical-directional 
reflectance  factor,  which  is  defined  as  the  ratio  of  the  flux  reflected  by  the  sample  to  that  which 
would  be  reflected  by  an  ideal  Lambertian  reflector  for  the  same  geometry,  because/=  l/;r  for 
the  latter  according  to  the  discussion  following  Eq.  (9). 

From  the  definition  of  the  BRDF,  the  scattered  radiance  can  be  related  to  the  incident 
radiance  by 


me^,<t>,)  =  J  L,(0„<^,)/(0„^,;0„^,)cos0,da,., 


(3) 


which  can  be  rather  complicated  to  evaluate  analytically.  The  emissivity  is  given  by  Kirchhoff  s 
law  as 


e(0, .  ^, )  =  1  -  J  /(0, .  <t>i ;  )  cos  (4) 

It: 

and  is  thus  simply  related  to  the  (directional-hemispherical)  reflectance,  as  discussed  in  the 
introduction. 

According  to  the  Helmholtz  reciprocity  theorem, so  that  either 
direction  may  be  that  of  the  incident  beam  with  the  other  the  reflected  beam.  Note  from  Eqs.  (1) 
and  (2)  that  this  would  imply  that  the  directional-hemispherical  reflectance  equals  the 
hemispherical-directional  reflectance  factor;  interestingly  enough,  however,  equality  of  these  two 
reflectance  terms  remains  true  even  when  Helmholtz  reciprocity  does  not  (Nicodemus,  1970). 
There  has  been  considerable  discussion  about  the  conditions  under  which  reciprocity  holds 
(Clarke  and  Parry,  1985;  Kriebel,  1996;  Shirley  et  al,  1997;  Snyder,  1998)— for  example,  it 
clearly  does  not  apply  to  a  Faraday  isolator.  It  is  generally  accepted  however  that  the  theorem  is 
obeyed  by  most  materials  (including  compound  objects)  under  ordinary  conditions  and  hence 
models  which  are  not  reciprocal  are  generally  considered  unphysical  and  experimental  failures 
are  often  attributed  to  measurement  errors  or  limitations. 


BRDF  Models  of  Surface  Scattering 

It  is  often  possible  to  distinguish  surface  scattering  from  volume  scattering.  Some  samples 
(e.g.,  bare  metals)  have  a  penetration  depth  (equal  to  die  reciprocal  of  the  absorption  coefficient) 
which  is  so  small  that  all  of  the  reflection  can  be  assumed  to  occur  at  its  illuminated  or  first 
surface.  Other  samples  (e.g.,  a  painted  object)  exhibit  scattering  both  from  the  first  surface  (and 
possibly  subsequent  surf^aces,  such  as  the  paint-substrate  interface)  as  well  as  from  the  bulk 
interior  (e.g.,  due  to  defects  or  paint  pigments).  Still  other  samples  (e.g.,  a  forest  imaged  from  an 
airplane)  do  not  have  a  well-defined  first  surface  at  all  and  could  perhaps  be  best  described  as 
pure  volume  scatterers  (Snyder  and  Wan,  1998).  Volumetric  reflection  almost  invariably 
involves  multiple  scattering  (although  a  sufficiently  cratered  surface  can  as  well).  This  greatly 
adds  to  the  complexity  of  the  problem  and  is  not  fully  understood.  For  this  reason,  I  will  restrict 
attention  to  first-surface  scattering. 

Begin  by  considering  the  simplest  case.  A  purely  specular  reflector  (i.e.,  a  perfectly  smooth 
and  planar  mirror)  has  a  BRDF  equal  to  zero  unless  0^  =  (j),.  =  +  K,  in  which  case  the 

reflectance  is  described  by  the  Fresnel  equations,  so  that  (Ellis,  1994) 
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/=m) 


(5) 


cos  0,  sin  0^ 

assuming  that  the  illuminated  area  of  the  mirror  is  large  enough  (compared  to  the  wavelength  of 
the  light)  that  diffraction  from  it  into  off-specular  directions  is  negligible.  For  nonmagnetic 
materials  (Hecht,  1998), 


and 


with 


with 


cos  0,  -Jn^-  sin^  0,. 

r  = - -■■■ 

COS0,.  +  -yln^  - sin^  0,. 


^  COS0,  -  sin^  0. 

’’  cos 0.  -t-  ^jn^  -sin^Oj 


(6a) 

(6b) 


for  polarization  perpendicular  (“senkrecht,”  also  called  horizontal  or  TE  and  denoted  by  an  h, 
or  ±)  and  parallel  (sometimes  referred  to  as  vertical  or  TM  and  symbolized  by  v,  +,  or  II)  to  the 
plane  of  incidence,  respectively.  Here  n  =  n/n,  where  and  n,  are  the  (complex)  refractive 
indices  of  the  incident  medium  and  the  transmitted  medium  (i.e.,  the  top  layer  of  the  sample), 
respectively;  n,  =  1  and  n,  =  n  correspond  to  the  usual  case  of  light  in  space  scattering  off  a 
medium  of  index  n.  Any  incident  beam  can  of  course  be  decomposed  into  s  and  p  components, 
provided  that  proper  accounting  is  made  of  their  relative  phase.  Recall  that  these  Fresnel 
reflectances  are  strongly  dependent  upon  the  angle  of  incidence — for  example,  Rp  falls  to  zero  at 
the  Brewster  angle  for  a  dielectric  (or  has  a  minimum  at  the  pseudo-Brewster  angle  for  a 
conducting  medium)  but  rises  to  unity  for  grazing  incidence.  Substitution  of  Eq.  (5)  into  Eq.  (1) 
identifies  /?(0,)  as  the  directional-hemispherical  reflectance;  if  the  mirror  is  ideal  (i.e.,  has  infinite 
conductivity)  then  this  reflectance  is  unity  for  any  polarization  and  wavelength  (Bom  and  Wolf, 
1965).  Note  that  a  flat  specular  reflector  does  not  depolarize  incident  s  or  p  radiation,  although  it 
will  reverse  the  handedness  of  circular  light  and  rotate  the  plane  of  polarization  of  light  linearly 
polarized  in  directions  other  than  s  and  p,  as  discussed  in  more  detail  later. 

All  real  surfaces  have  some  roughness  however.  If  the  characteristic  size  scales  (to  be 
defined  more  precisely  below)  of  this  roughness  are  large  compared  to  the  wavelength  of  the 
light,  then  the  approximation  of  geometrical  optics  holds.  In  that  case,  the  simplest  model  for  the 
surface  scattering  is  that  of  Torrance  and  Sparrow  (1967),  who  pictured  the  surface  as  being 
comprised  of  small,  randomly  oriented,  specular  facets.  Neglecting  shadowing  and  masking  of 
the  facets  by  each  other,  the  BRDF  for  single  scattering  from  this  isotropic  surface  is  easily 
shown  to  be 


f  =  R(P)- 


m) 


(7) 


4  cos  0;  cos  0,. 

where  2j3  is  the  angle  between  the  incident  and  reflected  directions,  known  as  the  bistatic  angle 
(2/3  =  0  corresponds  to  monostatic  or  back  scattering,  while  2/3  =  ;r  refers  to  forward  scattering 
pertinent  to  the  optical  extinction  theorem  for  example),  and  H(0„)  is  the  density  function  (in 
units  of  sf ‘)  of  facet  normals  pointing  in  the  direction  0„  relative  to  the  macroscopic  surface 
normal,  i.e.,  5(0„)  cICi„  dA  is  die  total  (not  the  projected)  surface  area  of  the  facets  spanned  by  dA 
which  have  normals  lying  within  solid  angle  dQ„  (Maxwell  and  Weiner,  1974).  Clearly  this 
density  function  must  be  normalized  such  that 

tc/2 

iK  J  S(0„ )  COS  0„  sin  0„^/0„  =  1 .  (8) 


0 

Torrance  and  Sparrow  took  E  to  be  a  Gaussian  distribution  function  with  zero  mean.  It  can  be 
shown  that  cos2/3  =  cos0,cos0,  +  sin0,sin0,cos(^-^,),  cos0„  =  (cosO,  +  cos0,)/2cos/3,  and 

=  z?Q/4cos/3  using  spherical  geometry.  Observe  that  Eq.  (7)  explicitly  satisfies  Helmholtz 
reciprocity.  It  can  be  rewritten  in  terms  of  experimentally  measurable  quantities  by  noting  that 
E(0„)  is  proportional  to  the  monostatic  BRDF /(0„,O;0„,O).  The  polarization  dependence  is 
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determined  solely  by  the  Fresnel  coefficients;  hence,  in  the  plane  of  incidence  there  are  only  4 
independent  Mueller  matrix  elements— e.g.,  the  1 1, 12, 33,  and  34  terms,  which  can  be  related  to 
the  real  and  imaginary  parts  of  the  2  diagonal  elements,  and  rp(P),  of  the  Jones  scattering 
matrix.  Videen  et  al.  (1992)  have  shown  that  this  simple  polarization  prediction  is  in  quite  good 
agreement  with  experimental  measurements  on  scratched  or  sandblasted  copper  and  aluminum. 
Sung  and  Eberhardt  (1978)  have  improved  on  the  model  by  allowing  the  facets  to  be  slightly 
curved  and  calculating  a  perturbative  correction  using  the  Rayleigh-Rice  method  described  later. 
The  above  BRDF  needs  to  be  multiplied  by  a  geometric  attenuation  factor  G(  i,  h,  r)  to  correct 
for  shadowing  and  masking,  the  former  referring  to  the  partial  illumination  of  a  facet  shadowed 
by  an  adjacent  one  and  the  latter  to  the  partial  visibility  of  a  facet  occluded  by  another.  Here  i, 
n,  and  r  refer  to  unit  vectors  in  the  source,  facet  normal,  and  detector  directions  respectively,  as 
seen  from  the  illuminated  spot;  note  that  2p  =  cos'‘(  i*  f ).  For  simplicity,  Torrance  and  Sparrow 
supposed  that  each  facet  forms  the  side  of  a  symmetric  V-groove  cavity  whose  axes  are  parallel 
to  the  macroscopic  surface  (but  have  random  azimuthal  orientations)  and  whose  upper  edges  all 
lie  in  the  same  plane.  In  that  case,  the  fraction  G  of  a  given  facet  surface  which  contributes  to  the 
reflected  flux  (i.e.,  which  is  both  illuminated  and  visible)  can  be  found  geometrically  in  terms  of 
a  range  of  unobscured  angles  of  the  projections  of  i  and  r  into  the  plane  defined  by  h  and  the 
macroscopic  surface  normal;  the  grooves  are  assumed  to  be  long  enough  that  end  effects  can  be 
neglected.  Although  this  model  for  the  geometric  attenuation  is  very  simple,  it  successfully 
predicts  that  for  large  angles  of  incidence  the  peak  in  the  BRDF  in  the  plane  of  incidence  occurs 
at  an  angle  of  reflection  larger  than  specular.  Such  off-specular  peaks  are  seen  experimentally, 
both  for  coarsely  roughened  metals  and  dielectrics. 

Finally,  Torrance  and  Sparrow  assumed  that  the  multiply  reflected  light  (and  any  volume 
scattering)  was  purely  diffused.  A  purely  diffuse  (Lambertian)  scatterer  has  two  properties.  First, 
its  reflected  radiance  is  independent  of  the  viewing  angle  and  hence  is  equal  to  the 
hemispherical  exitance  H,  divided  by  ;r  (again  not  2;r).  The  reflected  flux  per  unit  solid  angle  is 
proportional  to  cos^^,  which  is  called  Lambert’s  law;  since  the  projected  area  of  a  surface 
element  also  varies  as  cos^^.  this  is  consistent  with  the  fact  that  is  independent  of  the  viewing 

angle.  Equation  (3)  then  implies  that/ is  independent  of  6^  and  and  so  Eq.  (1)  becomes 

/=  Second,  its  directional-hemispherical  reflectance  is  independent  of  the  angle  of 

incidence  and  thus  is  equal  to  the  bihemispherical  reflectance;  this  property  is  a  statement  of 
reciprocity.  The  BRDF  is  then  related  to  the  bihemispherical  reflectance  according  to 
^  ^  P(27C;27C) 

K 

and  is  independent  of  both  the  incident  and  reflected  angles.  If  the  surface  is  ideal  (i.e.,  has  unit 
albedo)  then  the  BRDF  further  simplifies  to  Hn.  Incidentally,  note  that  a  blackbody  has  zero 
albedo  according  to  Eq.  (4)  and  hence  obeys  Lambert’s  law  in  emission;  this  explains  why  the 
sun  appears  as  a  uniformly  bright  disk  even  though  it  actually  is  a  sphere.  It  should  be 
emphasized  that  Eq.  (9)  does  not  imply  that  is  independent  of  the  incident  angle:  on  the 
contrary,  Eq.  (3)  says  that  the  reflected  radiance  due  to  a  well-collimated  incident  beam  of  fixed 
radiance  is  proportional  to  cosdi  because  the  flux  incident  on  a  unit  area  of  the  surface  varies  in 
this  way  (Torrance  and  Sparrow,  1967);  this  means  that  a  spherical  Lambertian  scatterer 
illuminated  from  a  fixed  direction  will  appear  shaded  around  the  edges  (Oren  and  Nayar,  1995), 
not  flat  as  some  authors  have  incorrectly  claimed  (Ellis,  1994),  in  striking  contrast  to  a  spherical 
Lambertian  emitter.  It  should  also  be  borne  in  mind  that  many  rough  objects,  such  as  an  ordinary 
piece  of  paper,  are  reasonably  Lambertian  at  near-normal  viewing  angles,  but  become 
increasingly  specular  at  grazing  angles  due  to  the  increase  in  the  first-surface  Fresnel  reflectance 
as  well  as  to  the  compression  of  the  surface  roughness  in  the  viewing  direction — ^this  latter  effect 
gives  rise,  for  example,  to  the  horizontally  streamed  reflections  commonly  seen  off  waxed  tile 
floors  (Shirley  et  al,  1997).  Finally,  note  that  by  substituting  Eq.  (5)  or  (9)  into  (1)  or  (2),  the 
directional-hemispherical  reflectances  of  an  ideal  mirror  and  of  an  ideal  diffuser  are  found  to  be 
equal,  and  likewise  for  the  hemispherical-directional  reflectances  (Judd,  1967). 
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Multiple  scattering  from  the  surface  or  volume  of  an  object  tends  to  depolarize  the  incident 
light;  even  two  reflections  from  a  valley  can  cross  sorp  polarization,  as  nicely  illustrated  in  Fig. 
14  of  O’Donnell  and  Mendez  (1987).  For  this  reason,  the  light  scattered  by  a  Lambertian  surface 
is  typically  assumed  to  have  random  polarization.  However,  this  is  not  necessarily  true,  because 
the  two  defining  properties  which  led  to  Eq.  (9)  do  not  explicitly  require  multiple  scattering.  For 
example,  one  could  imagine  choosing  the  density  function  in  Eq.  (7)  to  give  a  BRDF  that  is  at 
least  approximately  independent  of  incident  and  reflected  angles,  thus  characterizing  a 
Lambertian  diffuser,  although  only  singly  scattered,  specular  reflections  from  isotropic  facets  are 
involved  which  consequently  cannot  depolarize  sorp  light.  Even  volume  scattering  from  a  paint 
does  not  fully  randomize  the  polarization  of  the  radiation,  as  Ellis  (1996b)  has  demonstrated 
experimentally.  Therefore,  one  should  be  careful  about  assuming  that  diffusers  always  reduce  the 
degree  of  polarization  of  a  beam  to  nearly  zero,  in  the  absence  of  detailed  knowledge  of  the 
scattering  processes. 

Oren  and  Nayar  (1995)  have  developed  an  alternative  form  of  Torrance  and  Sparrow’s  model 
in  which  each  facet  is  assumed  to  reflect  purely  diffusely  rather  than  specularly.  In  effect,  this 
presumes  that  the  surface  has  two  distinct  roughness  scales  and  hence  is  said  to  be  compositely 
rough  (Barrick,  1970):  a  coarse  range  specified  by  macroscopically  flat  facets  whose  dimensions 
are  large  compared  to  the  wavelength  of  the  light,  and  a  fine  range  responsible  for  Lambertian 
scattering  from  each  facet.  [A  very  different  two-roughness-scale  model  has  been  developed  by 
Leader  (1979).]  Their  model  was  developed  to  describe  the  reflectances  of  materials  such  as 
plaster,  sandpaper,  and  cloth  for  computer  rendering  applications,  and  they  performed 
experimental  measurements  which  gave  fairly  good  agreement  with  the  cdculations.  Although 
one  might  suppose  that  the  overall  effect  of  the  two  roughness  scales  in  the  model  is  merely  to 
forther  roughen  the  surface  and  hence  leave  the  overall  scattering  approximately  Lambertian,  this 
is  incorrect:  the  reflectance  is  markedly  non-Lambertian  and  in  fact  has  a  strong  backscattering 
peak  for  large  angles  of  incidence  which  gets  cut  off  for  reflection  angles  beyond  the  source 
direction  (i.e.,  6^  <  -0,).  Notice  that  this  is  completely  opposite  to  the  strong  forward  scattering 
peak  predicted  by  the  Torrance  and  Sparrow  model. 

The  cause  of  the  backscattering  peak  is  reminiscent  of  Hapke’s  (1963)  shadow-hiding 
explanation  for  the  opposition  effect  of  the  moon,  wherein  its  brightness  peaks  at  full  moon  when 
the  sun  is  directly  behind  the  earth.  In  the  lunar  case,  the  relevant  scattering  is  volumetric, 
because  the  dust  on  the  moon  is  very  porous  and  is  taken  to  be  an  open  network  of  particles. 
Particles  closer  to  the  surface  cast  shadows  down  on  the  lower  particles,  which  thus  reduces  the 
reflected  radiance  in  every  direction  except  the  incident  one  where  the  shadows  are  hidden  by  the 
particles  that  created  them.  To  put  it  another  way,  the  light  will  be  attenuated  both  as  it 
penetrates  the  medium  and  as  it  leaves  it  after  being  reflected,  unless  it  backscatters  into  the 
incident  direction  in  which  case  it  can  retrace  its  path  out  and  escape  without  being  blocked.  It  is 
assumed  that  the  albedo  p  of  the  individual  particles  is  small  enough  that  multiple  scattering  is 
negligible  and  that  the  particles  are  large  enough  that  diffraction  around  their  edges  is  minimal, 
since  either  of  these  would  tend  to  wash  out  the  effect.  After  making  a  number  of  simplifying 
approximations,  the  BRDF  of  the  medium  can  then  be  expressed  as  the  product  of  three  factors, 
each  of  which  satisfies  Helmholtz  reciprocity. 


/  = 


1 


COS0..  -1-COS0, 


Rs(P,g)SiP). 


(10) 


The  first  factor  is  the  well-known  Lommel-Seeliger  scattering  term  arising  from  the  total 
dist^ce  traveled  through  the  attenuating  medium  in  the  course  of  reflecting  from  a  typical 
particle.  The  second  term  is  called  the  retroreflection  function  and  decreases  monotonically  in 
value  from  2  for  2p  =  0  (backscattering)  to  1  for  2p  >  itl2.  It  depends  on  a  constant,  g,  of  the 
order  of  unity,  which  is  called  the  compaction  parameter  because  it  is  related  to  the  fractional 
volume  of  the  medium  occupied  by  the  particles.  The  last  factor  is  the  scatter  function,  defined  as 
the  ratio  of  the  scattered  power  per  unit  solid  angle  to  the  power  incident  on  a  particle  by  a  plane 
wave,  averaged  over  all  orientations  and  shapes  of  the  particles.  Taking  the  incident  plane  wave 
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to  be  of  unit  intensity,  this  implies 


(11) 


AJQJ 

where  is  the  cross-sectional  area  of  the  particle  (i.e.,  the  projected  area  of  the  particle  onto  an 
incident  wave  front).  Notice  that  S(P)  is  approximately  equal  to  the  ensemble  average  of  the 
cosine-corrected  B^F  (Stover,  1990)  of  the  particles,  dPJ{dPf  the  equality  becomes  exact 
for  pancake-shaped  (i.e.,  planar)  particles.  The  scatter  function  is  written  by  Hapke  as 
S{P)  =  p  Z(j3),  where  Z(^  has  been  normalized  such  that 


kI2 


(12) 


0 

Both  Si^P)  and  X()3),  Just  like/,  have  units  of  sr'‘.  For  example,  5(j3)  =  l/4;r  describes  isotropic 
scattering  from  smooth,  infinitely  conducting  (i.e.,  ideally  specular)  spheroids — if  the 
conductivity  is  not  infinite,  then  Eq.  (6)  implies  that  there  will  be  a  broad  forward  scattering  peak 
(Van  de  Hulst,  1957)  since  the  Fresnel  reflectances  only  become  equal  to  unity  at  grazing 
angles — ^while  E()3)  =  2[sin(2^  +  (;r-2)^cos(2)3)]/3;F  describes  backscattering  from  Lambertian 
spheres,  as  first  calculated  by  Schonberg.  As  can  be  seen  from  Hapke  Fig.  6,  substituting  this 
latter  scattering  function  into  Eq.  (10)  results  in  a  strong  backscattering  peak.  This  is  also  the 
explanation  for  the  glory  seen  when  looking  down  on  clouds  from  an  airplane  with  the  sun  at 
one’s  back  and  for  the  hot  spots  which  appear  on  vegetation  in  remote  sensing. 

Experimental  retroreflectance  measurements  of  MgCOj,  BaS04,  sulfur,  and  white,  red,  blue, 
and  black  Nextel  paints  were  later  performed  by  Egan  and  Hilgeman  (1976)  using  a  cube 
beamsplitter,  which  was  cleverly  followed  (rather  than  preceded)  by  a  chopper  so  as  to  reject 
light  scattered  by  the  room  or  the  prism.  Both  a  bandpass-filtered  tungsten  iodide  lamp  and  a 
632.8-nm  HeNe  laser  were  used  as  sources.  In  all  cases  an  opposition  effect  was  observed  that 
could  reasonably  be  described  by  Eq.  (10),  except  for  the  blue  and  black  paints  under  laser 
illumination  which  exhibited  an  anomalously  strong  backscattering  peak  attributed  to 
interference  effects.  Presciently,  the  opposition  effect  of  the  moon  is  now  known  to  result  not 
from  shadow  hiding  but  instead  from  coherent  backscatter  (Hapke  et  al,  1993).  This 
phenomenon,  related  to  weak  localization,  arises  as  follows  (Wolf  and  Maret,  1985).  Suppose 
that  a  wave  of  incident  propagation  vector  Iq,  experiences  m  elastic  scattering  events,  where 
m  >  2.  Let  k,  denote  the  propagation  vector  after  the  i*  event,  so  that  k„  points  in  the  direction  of 
observation.  In  the  case  of  backscattering,  k„  =  -Iq,,  and  hence  some  of  the  incident  field  can  also 
follow  the  time-reversed  path  -k„  — >  -k„,_,  -Iq,.  The  phase  difference  between  these  two 

paths  is  obviously  zero  and  thus  they  will  interfere  constructively,  giving  double  the  signal 
calculated  for  the  otherwise  incoherent  addition  of  intensities.  Convincing  evidence  that  this 
enhancement  effect  is  responsible  for  lunar  backscattering  was  provided  by  examining  the 
circular  polarization  ratio  of  scattered  radiation  from  soil  samples  with  incident  circular  laser 
light.  Shadow  hiding  involves  primarily  single  scattering  so  that  the  helicity  should  be  reversed, 
while  coherent  backscatter  involves  multiple  scatterings,  many  of  which  are  into  the  forward 
direction,  and  hence  the  original  polarization  should  be  partially  preserved,  and  this  is  what  is  in 
fact  observed.  The  angular  width  of  this  coherent  backscatter  pe^  is  supposed  to  approximately 
equal  A//,  where  I  is  the  transport  mean  free  path  for  photons  in  the  medium;  for  strongly 
absorbing  particles  as  in  the  case  of  the  moon,  /  is  roughly  equal  to  the  average  spacing  between 
scatterers.  Analysis  of  the  data  implies  Z  =  1  pm,  in  contrast  to  the  40  jim  mean  particle  size  in 
the  lunar  dust;  hence,  the  scatterers  must  be  small  asperities  on  the  grain  surfaces,  rather  than  the 
particles  themselves.  This  is  consistent  with  the  observation  of  coherent  backscatter  from 
wavelength-sized  roughness  on  well-characterized  surfaces  (O’Donnell  and  Mendez,  1987). 

Returning  to  Oren  and  Nayar’s  model,  suppose  that  a  V-cavity  is  illuminated  from  the  right. 
Then  the  left  facet  will  be  brighter  than  the  right  one  because  it  receives  more  light.  Viewed  from 
the  left,  an  observed  sees  principally  the  darker  right  facet  and  comparatively  litde  of  the 
foreshortened  left  facet.  But  as  he  moves  toward  the  source  direction,  the  fraction  of  the  brighter 
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area  increases  while  that  of  the  darker  decreases  and  hence  the  BRDF  increases  in  the 
backscattering  direction  and  is  inherently  non-Lambertian.  Specifically,  the  BRDF  for  a  single 
Lambertian  facet  whose  normal  is  inclined  at  polar  angle  d„  and  azimuthal  angle  relative  to  the 
macroscopic  surface  normal  is 

/  =  -^cos^Jl  +  tan  6/,.  tan6/„  cos(0,.  -  0J][1  +  tan6>,  tan6>„  cos(0,  -  0J],  (13) 

where  p  i§  the  albedo  of  the  facet.  This  expression  is  then  multiplied  by  a  geometric  attenuation 
factor  G(  i,  n, f )  to  account  for  shadowing  and  masking,  and  the  result  is  averaged  over  <}>„ 
assumng  a  uniform  distribution  of  azimuthal  orientations  of  the  V-grooves.  Finally,  that  is 
multiplied  by  the  density  function  of  facet  normals  H(0„)  and  integrated  with  respect  to  sin6„dd 
over  the  hemisphere.  Oren  and  Nayar  chose  s,(d„)  cosO„  to  be  a  Gaussian  with  zero  mean,  which 
differs  slightly  from  Torrance  and  Sparrow  (1967)  or  Maxwell  and  Weiner  (1974)  who  chose 
^{d„)  itself  to  be  such.  Finally,  two-bounce  interreflections  were  taken  into  account  by  „ 
integrating  over  all  positions  on  the  two  faces  of  a  V-groove  which  connect  directions  i  and  f 
geometrically,  where  shadowing  and  masking  determine  the  limits  of  integration,  and  finally 
again  integrating  over  and  d„.  The  singly  scattered  and  doubly  scattered  BRDFs  were  then 
added  to  give  the  total  BRDF,  which  exhibits  reciprocity.  Some  of  the  required  integrals  are 
fairly  complicated  and  were  evaluated  numerically  or  approximated  functionally.  The  resulting 
BRDF  is  nearly  Lambertian  for  small  angles  of  incidence,  wherein  both  facets  of  every  V-cavity 
have  sirmlar  irradiance,  as  well  as  for  azimuthal  angles  90°  out  of  the  plane  of  incidence  since 
the  relative  irradiance  of  both  facets  is  then  approximately  constant  with  respect  to  the  polar 
angle  of  reflection.  But  there  is  a  strong  backscattering  peak  when  the  source  and  viewing 
directions  coincide,  as  well  as  an  interreflection  enhancement  in  the  forward  direction,  and  the 
rendered  image  of  a  curved  surface  illuminated  from  the  viewer  direction  can  be  maHp  very  flat, 
mimicking  the  appearance  of  the  moon  or  of  a  photographed  clay  vase  for  example.  Of  course 
the  model  reduces  to  pure  Lambertian  scattering  if  the  roughness  (as  parametrized  by  the 
Gaussian  standard  deviation  in  the  facet  density  function)  is  set  equal  to  zero. 

The  facets  in  Torrance  and  Sparrow’s  or  Oren  and  Nayar’s  model  are  assumed  to  be  large 
compared  to  the  wavelength  X  of  the  incident  light.  More  specifically,  any  non-composite, 
isotropic  surface  (or  any  individual  component  of  a  composite  surface)  can  be  characterized  by 
two  roughness  sc^es — an  out-of-plane  length  quantified  by  the  rms  surface  height  variation  cr 
and  an  in-plane  distance  parametrized  by  the  correlation  length  /.  In  the  facet  models,  both  of 
Aese  parameters  (for  the  coarse  component  in  Oren  and  Nayar’s  case)  must  be  much  larger  than 
A.  Mathematically  (Bennett  and  Porteus,  1961),  the  surface  height  profile  is  taken  to  be  z  =  C(x  v) 
with  the  zero  level  set  equal  to  the  mean, 

j  YI2  XI2 

1  fC(x,y)dxdy  =  0,  (14) 

-y/2  -x/2 

where  the  sample  has  been  taken  to  be  infnite  in  area,  macroscopically  spanning  the  xy-plane, 
for  simplicity.  With  these  definitions,  the  mean  square  surface  height  is  given  bv  the  variance' 

I  ri2  XI2  o  o  j  , 

>=  X  w  /  iCix,y)dxdy,  (15) 

-y/2  -‘Xti 

and  the  autocorrelation  (or  autocovariance)  function  is  (Hecht,  1998) 

j  Y/2  X/2 

A(5’0  =  ^lim^—  I  l^ix,yXix-s,y-t)dxdy,  (16) 

-K/2  -X/2 

so  that  A(0,0)  =  For  an  isotropic  surface,  A(s,t)  =  A(t,s)  so  that  the  correlation  function  is  only 
a  function  of  a  single  variable  t=(s^  +  known  as  the  lag;  the  correlation  length  /  is  defined 
as  that  lag  for  which  A  diminishes  to  l/e  of  its  peak  value  (i.e.,  to  a^/e).  There  are  two  commonly 
used  statistical  distributions  for  rough  surfaces  (Barrick,  1970).  The  first  assumes  Gaussian 
correlations. 
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(17) 


for  which  the  rms  slope  of  the  surface  can  be  shown  to  be 


(which  equals  the  rms  value  of  iz.n6^  in  the  facet  models  and  hence  is  determined  in  turn  by  the 
density  function).  The  other  supposes  the  surface  height  autocorrelation  function  to  be 
exponential, 

=  (19) 


in  which  case  m  turns  out  to  be  undefined,  because  such  surfaces  are  jagged  with  many  vertical 
facets,  describing,  as  an  example  in  remote  sensing,  an  urban  area  including  buildings. 

The  requirement  that  CT»  A,  as  well  as  the  naive  model  of  V-grooves  having  coplanar  top 
edges,  can  be  relaxed  by  invoking  wave  rather  than  geometrical  optics.  Nevertheless,  many 
simplifying  assumptions  remain  in  order  to  make  the  physics  tractable;  the  general  problem  of 
optical  scattering  by  an  arbitrarily  specified  surface  has  not  been  solved,  even  without 
considering  volumetric  scattering  of  the  portion  of  the  beam  transmitted  below  the  first  surface. 
Briefly,  the  goal  is  to  find  the  reflected  electromagnetic  field  for  some  known  incident  field  and 
characterized  surface  by  satisfying  the  required  boundary  conditions.  Typically  shadowing, 
masking,  multiple  reflections,  and  surface  waves  are  neglected.  One  approach  has  been 
developed  in  great  detail  by  the  radar  scattering  community  (Beckmann  and  Spizzichino,  1987); 
note  that  the  (differential)  radar  cross  section  cr  (having  units  of  area)  is  related  to  the  BRDF  by 
/=  alAjcAcosdfosd^  where  A  is  the  (macroscopic)  illuminated  surface  area  of  the  sample  (Ruck 
et  al,  1970).  O’Donnell  and  Mendez  (1987)  have  verified  that  this  theory  is  in  good  agreement 
with  optical  measurements  on  appropriately  fabricated  samples. 

In  detail.  Maxwell’s  equations  imply  that  the  electric  field  satisfies  a  wave  equation,  and  if 
the  time  dependence  is  separated  out  (Fourier  analyzing  the  constituent  frequencies  if  necessary), 
the  Helmholtz  equation  is  obtained  for  any  scalar  component  of  the  field  (typically  the  incident 
and  reflected  radiation  are  decomposed  into  s  and  p  polarizations).  Green’s  theorem  can  be  used 
to  recast  this  differential  equation  as  the  Kirchhoff  integral,  which  expresses  the  field  anywhere 
in  space  in  terms  of  the  electric  field  and  its  gradient  at  every  point  on  any  closed  surface 
enclosing  the  spatial  point  of  interest.  In  the  present  application,  the  surface  is  taken  to  be  that  of 
the  object  together  with  an  enclosing  hemisphere  at  infinity,  with  the  latter  giving  zero 
contribution  to  the  result.  In  accord  with  Huygens’  principle,  each  point  on  the  material  surface 
is  taken  to  be  the  source  of  a  spherical  wave,  and  the  Fraunhofer  far-field  limit  is  considered,  in 
which  the  incident  and  scattered  beams  are  taken  to  be  plane  waves.  Finally,  the  Kirchhoff 
approximation  is  invoked  to  obtain  the  electric  field  and  its  normal  derivative  on  the  reflecting 
surface,  by  assuming  that  the  scattering  from  any  point  on  the  surface  is  described  by  the  Fresnel 
amplitude  reflection  coefficients,  r  in  Eq.  (6),  from  a  smooth  plane  tangent  to  that  point.  This  is 
the  most  serious  restriction  of  the  validity  of  this  approach,  as  it  obviously  requires  that  the  radii 
of  curvature  of  any  surface  irregularities  be  large  compared  to  the  wavelength,  or  equivalently, 
that  / »  A,  implying  a  gently  rolling  surface.  A  formal  integral  solution  to  the  problem  of 
scattering  by  a  rough  surface  of  finite  conductivity  is  then  obtained.  Unfortunately  it  is  too 
complicated  to  evaluate  for  real  situations  of  interest  unless  the  relevant  amplitude  reflection 
coefficient,  which  depends  on  the  angle  of  incidence  relative  to  the  local  surface  normal,  is 
constant  across  the  surface.  In  practice,  this  occurs  either  because  the  conductivity  can  be  taken 
to  be  infinite  or  because  the  surface  roughness  is  sufficiently  small  that  we  can  replace  the 
reflection  coefficient  by  its  average  value.  For  the  simpler,  perfectly  conducting  case,  integration 
by  parts  gives  the  scattered  field  in  terms  of  a  Fourier  transform  over  the  area  A  of  the  isotropic 
surface. 


(20) 


= F(.e,.e,4, 

^r,smoorh  ^  a 

neglecting  edge  effects,  where  k,  and  k,.  are  the  incident  and  reflected  propagation  vectors, 
respectively,  with  magnitude  k  =  ItjJX,  is  the  field  which  would  be  diffracted  into  the 

specular  direction  if  the  finite-sized,  infinitely-conducting  surface  were  smooth  and  the  incident 
wave  were  5-polarized,  F  =  [1  -i-  cos6,cosd,  +  smdiSm6,cos(^-<l>,)]/[cos6i  (cosdj  +  cos0,.)],  and 
=  (x,y,^  is  a  point  on  the  surface.Accounting  for  the  Huygens-Fresnel  Principle  (cf.  Sec. 

10.3.1  of  Hecht,  1998),  the  projected  area  of  the  sample  (A  cos0,),  and  the  solid  angle  subtended 
at  the  detector  (dQ^),  I  find  that  the  BRDF  is  given  by 

(21) 


Acosd- 
A' cose, 


which  satisfies  Helmholtz  reciprocity.  For  a  statistically  rough  surface,  Eq.  (21)  needs  to  be 
averaged  by  multiplying  it  by  the  normalized  height  distribution  p{Q  and  integrating  over  all 
The  usual  assumption  is  that  this  distribution  is  Gaussian,  i.e.. 


P(0  = 


C 

(22) 


Notice  that  Eq.  (21)  will  involve  a  quadruple  integral  (over  say  dAdA)  which  can  be  recast  in 
terms  of  the  autocorrelation  function.  Assuming  that  this  is  also  Gaussian  and  given  by  Eq.  (17), 
a  rather  complicated  final  expression  for  the  scattering  results. 

Two  special  cases  are  of  most  interest,  however.  For  a  slightly  rough  surface  (cr«  A),  the 
scattering  is  predominantly  specular  and  the  directional-conical  reflectance  is  approximately 


(23) 


where  spans  the  specular  lobe  centered  about  0,  =  6,  and  =  <^,-¥  it  and  is  roughly  equal  to 
AVA  due  to  diffraction  from  the  finite-sized  surface  A  (which  is  determined,  for  example,  by  the 
usual  product  of  sine-squared  functions  for  a  rectangular  surface).  Equation  (23)  is  a  well-known 
result  nicely  derived  from  the  Fraunhofer  diffraction  formula  by  Davies  (1954).  Bennett  and 
Porteus  (1961)  have  experimentally  verified  that  it  gives  the  specular  reflectance,  at  sufficiently 
long  wavelengths,  of  a  slightly  rough  surface  relative  to  a  smooth  surface  of  the  same  material. 
Good  agreement  was  obtained  with  measurements  in  the  mid-infrared  for  glass  disks  roughened 
with  ~10-|im  grit,  overcoated  with  aluminum,  and  characterized  by  stylus  profilometry.  Normal 
incidence  was  employed,  thus  minimizing  effects  such  as  shadowing  and  any  polarization 
dependence.  These  measurements  can  also  be  described  in  terms  of  the  total  integrated  scatter 
(TIS),  defined  as  the  ratio  of  the  diffusely  to  the  specularly  scattered  powers.  Writing  the 
specularly,  diffusely,  and  total  scattered  powers  as  and  respectively, 

one  sees  for  a  slightly  rough  surface  that 

Tis  =  :^LZ^  =  lZ^„(2A:<TcosF,.f,  (24) 

^spec  Pspec 


where  the  second  equality  presumes  that  slight  roughening  merely  redistributes  the  total  power 
scattered  by  a  smooth  sample  without  diminishing  it,  as  follows  from  the  assumption  of  infinite 
conductivity.  Note  that  to  the  level  of  the  approximations  used  above,  the  TIS  is  also  equal  to 
1  -  (PspJPJ  =  1  -  iPspeJPsvec.smoo,h\  whore,  recalling  Eq.  (5),  P,p,c.smoo,h  =  is  the  specularly 

reflected  power  by  a  smooth  surface  of  the  same  material,  thus  explaining  other  definitions  of  the 
TIS  used  in  the  literature  (e.g.,  Elson  and  Bennett,  1979b).  Church  et  al  (1977)  have  shown, 
from  the  Rayleigh-Rice  perturbation  theory  discussed  below,  that  Eq.  (24)  holds  regardless  of  the 
specific  form  chosen  for  the  height  distribution  function  p(^  in  Eq.  (22)  and  hence  is  not 
dependent  on  the  assumption  of  Gaussian  statistics — see,  for  example,  the  discussion  in 
connection  with  Eq.  (34)  below.  A  particularly  nice  feature  of  TIS  measurements  is  that  no 
reference  sample  is  necessary:  the  diffusely  reflected  power  can  be  measured  using  an 
integrating  sphere  and  one  detector,  while  a  small  hole  (coincident  with  the  input  hole  for  normal 
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incidence)  and  a  second  detector  is  used  to  measure  the  specularly  reflected  power  (Stover, 

1990).  Also  note  that  Eq.  (24)  provides  a  quantitative  basis  for  the  Rayleigh  criterion  for  the 
roughness  of  a  sample,  wherein  a  surface  is  respectively  considered  rough  or  smooth  if 
4;rc5Cos0/A,  which  equals  the  phase  difference  between  two  specular  rays  reflected  from  points 
separated  by  a  height  tron  the  surface,  is  large  or  small  compared  to  jdl  or  approximately  1 
(Beckmann  and  Spizzichino,  1987). 

Equation  (23),  giving  the  specular  reflectance  of  a  slightly  rough  surface,  is  in  fact 
proportional  to  l<£,>P.  However,  the  total  scattered  intensity  is  more  exactly  given  by 
<\Ejh>  =  \<E;>\^  +  <\E,  -  <E;>\h>.  The  angular  distribution  of  the  diffusely  scattered  radiation  is 
determined  by  the  second  term  on  the  right-hand  side  of  this  equality,  which  defines  the  variance 
of  the  complex  scalar  amplitude  E„  given  in  turn  by  the  sum  of  the  vm^ces  of  its  real  and 
imaginary  parts.  In  general,  this  depends  upon  the  joint  probability  distribution  of  finding  the 
height  to  be  at  a  lag  raway  from  a  point  where  the  height  is  Ci-  However,  for  a  slightly  rough 
surface  the  integral  of  interest  can  be  expanded  to  first  order  in  QX  and  alX,  with  the  result  that 
the  variance  depends  only  on  the  autocorrelation  function  of  Eq.  (17).  The  diffuse  BRDF  is  then 
approximately  found  to  be  (Beckmann  and  Spizzichino,  1987) 

_  [1  +  cos  0,  cos  6^  +  sin  6/  sin  9^  cos(0,  -  )]^  ^ 

An  cos  0.  cos  0^  (25) 


ex^[-k'^P  { sin^  0,.  +  sin^  0^  +  2  sin  0,.  sin  0^  cos(^^  -  /  4] . 

Similar  results  have  been  obtained  by  Davies  (1954)  and  Leader  (1979)  except  for  the  angul^ 
prefactor— as  discussed  by  Stover  (1990),  as  well  as  in  a  footnote  on  page  75  and  in  Appendix  A 
of  Beckmann  and  Spizzichino  (1987),  the  form  of  this  angular  term  depends  on  the  obliquity 
factor  used,  of  which  considerable  variation  exists  in  the  literature. 

The  other  important  case  of  Eq.  (21)  averaged  using  Eq.  (22)  is  very  rough  scattering 
((T»  X),  for  which  the  reflection  is  predominantly  difftise  and  given  by  the  BRDF 
^  1  [1  +  cos  0,  cos  0^  +  sin  0,  sin  0^  cos(0,  -  )]^ 

j  ^  "  .1—1.111  .-.III  - . .  X 

nrn^  cos  0,  cos  0^  (cos  0,  +  cos  0^ ) 

(26) 


sin^  0,  +  sin^  0^  +  2  sin  0,  sin  0,  cos(0^  -  ) 
(cos  0,  +  cos  0^  )^ 


where,  as  defined  in  Eq.  (18),  m  is  the  rms  surface  slope.  In  Davies  (1954),  the  argument  of  the 
exponential  is  larger  by  a  factor  of  2,  apparently  stemming  from  a  different  assumed  form  for  the 
surface-height  joint  probability  distribution. 

It  is  emphasized  again  that  the  tangent-plane  approximation  inherent  m  the  above  physical- 
optics  method  implies  that  the  solution  will  only  be  correct  in  the  high-optical-frequency  limit 
(X  « 1)  and  in  this  respect  is  not  more  accurate  than  geometrical  optics,  although  it  can  be 
applied  to  a  wider  variety  of  target  geometries.  A  perturbation  approach  can  be  used  to 
approximate  the  scattering  at  lower  frequencies  {X  <  /;  obviously  at  very  low  frequencies  a 
rough  surface  will  ultimately  appear  smooth  and  scatter  specularly)  and  has  the  added  advantage 
that  it  is  explicitly  vector  based.  However,  the  relevant  perturbation  is  of  the  surface  height  and 
hence  the  method  only  applies  to  slightly  rough  samples  (cr«  X).  The  basic  idea  was  first 
proposed  by  Rayleigh  in  1895  and  later  extended  by  Rice  (1951),  so  that  it  is  today  known  as  the 
Rayleigh-Rice  method.  The  height  profile  ^(x,y)  is  Fourier  analyzed  into  its  sinusoidal 
components  Z(k^ky).  Any  individual  component  acts  like  a  periodic  grating,  scattering  an 
incident  plane  wave  into  a  set  of  directions  given  by  the  2D  grating  equations.  Assuming  the 
surface  is  only  slightly  rough,  the  diffuse  scattering  will  be  dominated  by  the  first-order  peaks,  so 


that 


(k,  -  k,. )  •  X  =  k(sm  0^  cos  (j)^  -  sin  0,- )  =k^ 
(k^  -  k,.)  •  y  =  ksin  0^  sin0^  =ky 


(27) 
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where  and  k,  can  be  either  positive  or  negative  so  as  to  include  all  four  first-order  peaks  and 
where  I  have  followed  the  usual  convention  (e.g.,  Beckmann  and  Spizzichino.  1987;  Church  and 
Zavada,  1975)  of  onentmg  the  x-axis  so  that  =  -n.  Equation  (27)  can  be  viewed  as  an 
expression  of  conservation  of  linear  momentum.  The  total  electric  field  above  the  surface  (which 
for  simplicity  is  taken  to  be  infinitely  conducting)  in  the  case  of  incident  j-polarization  (which 
avoids  coupling  to  surface  waves)  is  then 

k,.ky 

where  the  0  subscripts  on  the  electric  fields  denote  the  complex  amplitudes,  suppressing  the 
unimportant  time  dependences.  The  first  term  on  the  right-hand  side  of  Eq.  (28)  is  the  incident 
field;  the  second  is  the  specularly  reflected  field,  whose  amplitude  is  equal  to  that  of  the  incident 
field  to  lowest  order  m  the  roughness  and  whose  sign  arises  from  the  180°  phase  shift  for 
external  TE  reflection  from  Eq.  (6a);  and  the  last  term  is  the  diffusely  reflected  field,  given  bv 
the  sum  of  the  fost-order  diffraction  peaks.  (Note  that  Rice  writes  =  mK  and  k,  =  nK,  where  K 
IS  some  arbitrarily  small  fundamental  spatial  frequency,  which  in  practice  can  be  chosen  by 
requinng  2;z/Ar  to  be  of  the  order  of  the  largest  experimentally  observable  spatial  wavelength 
namely  the  s^ple  or  beam  diameter.  Furthermore,  there  are  upper  limits  on  the  values  of  k  and 
ky,  beyond  wmch  the  diffracted  orders  disappear  into  the  surface;  for  example,  one  must  have 
(kj  4-  ^2 )'/2<  k  at  normal  incidence,  i.e.,  A  <  /.  Thus,  the  above  summation  can  actually  be 
considered  to  be  over  a  finite  set  of  integers  m  and  n.)  Maxwell’s  equation  V*E  =  0  requires  that 
each  component  wave  in  the  scattered  field  be  transverse,  kr'Eo^  =  0.  Furthermore, 
electromagnetic  boundary  conditions  must  be  imposed  at  the  surface.  For  example  the  two 
tangential  components  of  the  total  electric  field  must  vanish  on  the  surface,  rixEI  /-  =  0  where  n 
is  the  local  surface  normal  given  by  ’ 

(-dC/dx,-d^/dy,l) 

^jl  +  (^^/^x)^  +  (d^/dy)^ 

or  approximately  (-3^/9x,-^^/9y,l)  using  the  perturbation  assumption  (which  implies  the  surface 
slopes  are  gentle,  again  consistent  with  A  <  /).  These  first  derivatives  of  the  surface  profile  are 

Founer  transforms  of  k^(k^,k,)  and  k,Z(k^,k,).  A  matrix  equation  is  thus  obtained  for 
he  C^esian  components  of  the  unknown  amplitudes  Eo„  which  (as  for  n)  are  expanded  to 
lowest  nonzero  order  m  C  The  statistical  character  of  the  surface  profile  is  now  introduced  by 
ssuming  that  is  distnbuted  normally  about  zero  and  that  the  variables  k  and  k  are 

powL^spStr^deS^^^^  ensemble  average  of  the  squared  modulus  of  Z(k;k,)  defines  the 


W(k,,ky)  =  1/ 


sample.  (In  the  present  context,  the  1/A  prefactor  normalizes  the 
scattered  power  to  the  incident  power.)  According  to  the  Wiener-Khinchin  relation,  Eq.  (30)  can 
^so  be  expressed  as  the  Fourier  transform  of  the  autocorrelation  function.  In  polar  coordinates 
the  ^imuthal  integration  yields  a  Bessel  function  of  order  zero  for  an  isotropic  surface  (Elson  ’ 
^d  Bennett,  1979b),  so  that  Wis  only  a  function  of  {k}+k^yn.  Specifically,  assuming  a 
Gaussian  autocorrelation  function  as  given  by  Eq.  (17),  the  radial  integration  can  also  4 
periormed,  giving 

W^(k„k,)  = 

while  for  the  exponential  surface  height  correlations  of  Eq.  (19),  one  obtains 

iy(k„k,)  =  - - 

'  [l  +  (kl+kj)f] 

which  becomes  a  Lorentzian  in  the  case  of  ID  roughness,  C(x).  Note  that,  strictly  speaking, 
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Eq.  (32)  cannot  be  used  in  the  present  context  because  it  implies  (infinitely)  steep  surface  slopes, 
as  discussed  in  connection  with  Eq.  (19);  however,  many  researchers  have  ignored  this  point  and 
used  it  anyway,  implicitly  assuming  that  the  correlation  function  eventually  becomes  non¬ 
exponential  for  lags  below  the  range  of  measurement  (which  is  on  the  order  of  X  in  optical 
experiments  and  of  the  radius  of  the  stylus  when  using  mechanical  profilometry).  Specifically, 
the  autocorrelation  function  must  have  zero  slope  at  zero  lag  to  be  physically  meaningful  (Elson 
and  Bennett,  1979a). 

Elson  (1975;  also  see  Elson  and  Ritchie,  1974)  has  performed  an  analysis  which  is  essentially 
equivalent  to  the  preceding  Rayleigh-Rice  vector  theory,  except  that  he  has  used  the  full 
electromagnetic  boundary  conditions  rather  than  assuming  infinite  conductivity.  The  wave 
equations  were  expanded  to  first  order  in  ^(x,y)  and  solved  by  Green  function  methods.  His 
oft-quoted  result  for  the  diffuse  BRDF  is 

fdiff=K- COS0, cosd,  ■  Q •  Wik^,k^.)  (33) 

assuming  that  m  «  1.  If  is  given  by  Eq.  (31),  this  result  reduces  to  Eq.  (25)  provided  that  the 
form  of  Q  applicable  to  an  infinitely  conducting  surface  is  adopted  and  that  the  polarization 
factor  of  Eq.  (36)  is  accounted  for.  Note  that  some  authors  (e.g..  Church  et  ai,  1977)  employ  the 
symmetric  form  of  the  Fourier  transform  in  Eq.  (30),  and  hence  the  first  term  on  the  right 
becomes  4k*  instead.  Here,  k*  is  the  usual  “Rayleigh  blue-sky”  factor;  the  angles  comprise  an 
obliquity  factor  and  have  a  significant  effect  near  grazing;  Q,  which  is  proportional  to  the 
scattering  matrix  elements,  is  called  the  optical  factor  and  depends  on  the  relevant  material  and 
polarization  properties — it  has  been  tabulated  for  a  variety  of  cases  of  interest  by  Barrick  (1970) 
and  Church  et  al.  (1977),  reducing  to  the  Fresnel  reflectances  R  of  Eq.  (6)  for  small-angle  (i.e., 
near  specular)  scattering,  and  should  be  summed  (averaged)  for  a  polarization-insensitive 
detector  (unpolarized  source);  finally,  in  the  present  context  W(k^,ky)  from  Eq.  (30)  is  called  the 
surface  factor  (Elson  and  Bennett,  1979a).  Since  each  Fourier  component  of  the  surface  diffracts 
the  incident  light  into  a  unique  set  of  first-order  directions,  it  makes  intuitive  sense  that  there  is  a 
one-to-one  mapping  between  the  angular  distribution  of  the  scattered  light  (as  given  by  J)  and  the 
power  spectral  density  of  the  roughness  profile  (as  given  by  W).  Specifically,  consider  a  ID 
sinusoidal  grating  given  by  z(x)  =  2‘^<Tsin(/sjc).  Then  the  PSD  is  proportional  to  a  product  of  two 
Dirac  delta  functions, 

W(k„A:,,)  =  iK  WS(k^  ±  K)d(k^.),  (34) 

where  k^  =  ±K  gives  the  two  first-order  peaks.  Using  this  to  evaluate/in  Eq.  (33),  assuming 
small-angle  scattering  [which  by  Eq.  (27)  implies  K/k  «  sin0;,  i.e.,  a  low-spatial-frequency 
grating  and  an  angle  of  incidence  not  too  far  off-normal],  integrating  it  with  respect  to  cos0//n,. 
over  the  outgoing  hemisphere  (implicitly  excluding  the  specular  beam),  and  dividing  that  by 
R(d,)  gives  the  TIS,  which  agrees  with  Eq.  (24).  This  result  can  also  be  written,  to  lowest  order  in 
(T,  as  TIS  =  2p,  where  p  is  c^ed  the  diffraction  efficiency  and  is  defined  as  the  ratio  of  the  power 
diffracted  into  the  +1  and  0  orders  (Stover,  1975;  Church  et  ai,  1977),  with  the  factor  of  2 
arising  from  the  fact  that  the  same  amount  of  power  is  diffracted  into  the  +1  and  -1  orders  in  the 
small-angle-scattering  limit. 

Equation  (33)  has  been  compared  to  experimental  measurements  (Germer  et  ai,  1997)  on  a 
fabricated  silicon  microroughness  standard  having  a  pseudorandom  distribution  of  circular  pits 
(with  ~l-p,m  diameters  and  1-nm  depths)  using  a  cw  doubled-Nd:YAG  laser.  Excellent 
agreement  resulted  for  an  incident  angle  of  0,  =  45°,  reflected  polar  angles  of  6,  =  30°,  45°,  and 
60°  and  azimuthal  angles  of  0^  =  10°-170°,  and  the  four  polarization  combinations  ss,  sp,  ps,  and 
pp.  An  interesting  feature  of  the  data  is  that  the  pp  scattering  vanishes  for  certain  out-of-plane 
angles  (e.g.,  0,.  =  60°  for  6,  =  45°)  which  are  the  bidirectional  analogs  of  the  Brewster  angle. 
Furthermore,  ellipsometry  indicates  that,  for  incident  linearly  polarized  light,  the  out-of-plane 
(lateral)  scattered  light  remains  strongly  linearly  polarized,  albeit  with  a  rotated  plane  of 
vibration  in  general.  Both  of  these  effects  can  be  used  to  probe  other  scattering  mechanisms. 
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such  as  those  due  to  surface  particles  or  bulk  defects  (Germer,  1997),  whose  signals  might 
otherwise  be  masked  by  the  microroughness  surface  scattering. 

The  optical  factor  Q  describes  the  polarization  of  the  reflected  beam  relative  to  that  of  the 
incident  beam  and  can  be  considered  a  generalization  of  the  Fresnel  reflectances  of  Eq.  (6).  In 
particular,  the  cross-polarization  terms  Q,p  and  Qp,  are  zero  for  scattering  into  the  plane  of 
incidence.  For  the  common  case  of  in-plane  (longitudinal)  ss  scattering,  it  is  convenient  to  note 
(Stover,  1990)  that  the  optical  factor  is  simply  the  geometric  mean  of  ^(6,)  and  Rsid^).  If  the 
incident  light  is  polarized  but  neither  s  nor  p  (i.e.,  linearly  polarized  at  an  oblique  angle, 
circularly  polarized,  or  elliptically  polarized),  then  for  both  the  specular  and  diffuse  components 
even  the  singly  scattered  light  is  depolarized.  To  put  this  on  a  more  precise  footing,  the 
polarization  factor  p  of  a  wave  is  defined  as  where  the  complex  amplitude  Eq  has  been 

resolved  into  its  vertical  and  horizontal  components  (Beckmann  and  Spizzichino,  1987).  I  choose 
to  follow  the  sign  convention  wherein  any  scalar  component  of  the  field  is  written  as 


-  F 


(35) 


where  (0-=ck  'm  vacuum,  while  some  authors  choose  the  opposite  signs  for  k»r  and  (Ot. 
Accordingly,  Im{p}  =  0  implies  linear  polarization  (e.g.,  p  =  0  is  horizontal  and  p  =  <»  is 
vertical),  while  Im{p}  greater  or  less  than  0  refers  to  left-handed  or  right-handed  polarization, 
respectively  (e.g.,  p  =  /  is  left  circular  and  p  =  -i  is  right  circular).  If  the  polarization  of  the 
incident  wave  is  described  by  p,  and  that  of  the  reflected  wave  by  p^,  then  the  (complex) 
depolarization  factor  q  is  defined  by  the  relation  =  qpf,  specifically,  q=\  implies  no 
depolarization,  while  ^  =  0  or  <»  corresponds  to  a  filter,  scattering  only  one  component  of  the 
incident  light,  as  occurs  at  the  Brewster  angle  for  example.  [Note  carefully  that  depolarization 
here  refers  to  a  change  in  the  state  of  polarization  of  a  beam,  not  in  its  degree  of  polarization. 
Both  the  incident  and  scattered  waves  are  assumed  to  be  fully  polarized.  However,  many  workers 
(e.g.,  Renau  et  ai,  1967)  use  the  term  to  refer  instead  to  an  increase  in  the  randomly  polarized 
fraction  of  a  beam.]  For  an  ideal  specular  reflector,  ^  =  -1,  so  that  an  infinite,  flat,  perfect 
conductor  will  depolarize  all  incident  polarizations  except  s  and  p,  changing  both  the  orientation 
of  the  polarization  ellipse  and  its  handedness;  this  same  result  holds  for  longitudinal  scattering  by 
a  rough  surface  (of  infinite  conductivity)  if  the  tangent-plane  approximation  holds  (i.e.,  / »  A). 

If  a  smooth  plane  has  finite  conductivity  instead,  then  q  =  Vpid^lrJ^d^  and  one  sees  that  an 
obliquely  linearly  polarized  beam  (i.e.,  neither  s  nor  p)  incident  at  an  oblique  angle  (i.e.,  neither 
normal  to  nor  grazing  the  surface)  gives  rise  to  a  (longitudinal)  specularly  reflected  beam  which 
in  the  case  of  a  dielectric  remains  linearly  polarized  but  with  a  rotated  plane  of  vibration,  and 
which  in  the  case  of  a  metal  is  elliptically  polarized  (Jenkins  and  White,  1957);  in  contrast, 

^  =  -1  at  normal  incidence  and  +1  at  grazing  incidence. 

In  the  case  of  lateral  scattering,  even  incident  5  or  p  light  is  depolarized.  Consider,  for 
example,  a  vertically  polarized  wave  at  near  grazing  incidence  scattering  off  a  perfectly 
conducting  surface  element  tilted  upwards  about  the  jc  axis  by  45°.  Resolving  the  incident  field 
into  tangential  and  normal  components  and  applying  the  boundary  conditions,  the  reflected  wave 
is  found  to  be  horizontally  polarized.  This  effect  is  responsible  for  the  depolarization  of 
television  waves  by  tin  rooves  in  cities.  The  general  result  for  specular  reflection  off  facets  of 
finite  conductivity  is  shown  from  spherical  geometry  (Beckmann  and  Spizzichino,  1987)  to  be 

p,(r,  tan/?,  tan^,  -h  r^) -h (r^  -  r,)tani3, 

r^-nrptanAtanjSj-p.fr^tanjSj-r^tanA)’  ^ 

where 


and 


Pi  =  sin  * 


sin  0^  sin 


|_  -y/l  —  (cos  0.  cos  0^  -  sin  0,  sin  0^  cos  ^^  )^ 
P2  =  cos~‘[cos  Pi  cos<j)^  -  sin  Pi  cos0,  sin^J, 


(36b) 

(36c) 
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with  </>,  =  as  in  Eq.  (27).  It  is  easy  to  check  that  for  longitudinal  scattering  (0,  =  0),  one  gets 
P^=  ^  =  0,so  that  q  =  rjr,  as  above.  These  expressions  are  already  rather  complicated,  without 
even  considering  the  depolarization  of  waves  diffracted  into  non-specular  directions,  as  occurs  in 
the  Rayleigh-Rice  model  for  example.  Note  that  knowledge  of  the  optical  factors  is  not  sufficient 
to  describe  scattering  from  and  to  states  of  arbitrary  polarization,  because  the  four  Q  values 
involve  intensity  rather  than  amplitude  ratios  and  hence  the  necessary  phase  information  is  not 
available.  For  example,  a  beam  linearly  polarized  at  45°  to  the  vertical  and  a  circularly  polarized 
wave  both  have  equal-intensity  s  and  p  components  and  hence  cannot  be  distinguished  on  that 
basis  alone.  Unfortunately,  no  general  analytical  solution  to  the  problem  of  depolarization  by  a 
rough  surface  exists. 

To  circumvent  this  situation  and  the  limitations  inherent  in  the  Kirchhoff  method  (i.e.,  the 
tangent-plane  approximation  l»  X  and  the  assumption  of  infinite  conductivity)  or  in  the 
Rayleigh-Rice  theory  (namely,  the  assumption  of  slight  roughness  a«X  and  gentle  slopes 
X  <  1),  direct  numeric^  solutions  of  the  electromagnetic  equations  have  been  undertaken.  In 
essence,  one  starts  from  a  vector  form  of  the  KirclAoff  integral  (Jackson,  1975) — also  known  as 
the  Chu-Stratton  integrals  (Ruck  et  ai,  1970) — for  the  electric  and  magnetic  fields,  E  and  H, 
which  express  the  total  fields  (or  more  simply,  the  scattered  fields  if  the  incident  fields  are 
subtracted  off)  at  any  point  in  space  in  terms  of  their  values  on  the  sample  surface  and  a  free- 
space  Green  function.  In  turn,  the  surface  fields  can  be  related  to  the  induced  surface  current 
density  J.  Finally,  J  can  be  obtained  from  a  similar  integral  by  relating  the  scattered  fields  on  the 
surface  to  the  incident  fields  via  the  boundary  conditions,  taking  care  to  handle  the  singularity 
when  the  cirgument  of  the  Green  function  is  zero  (i.e.,  when  the  source  point  is  identified  with  the 
field  point).  These  integral  equations  can  also  be  obtained  more  directly  from  the  optical 
extinction  theorem  (Soto-Crespo  and  Nieto-Vesperinas,  1989).  To  solve  them  numerically,  the 
finite  surface  is  chopped  into  a  discrete  number  of  sampling  joints,  say  AT  in  all.  The  elements  of 
surface  area,  sec  6„d)^y,  and  the  surface-normal  unit  vectors  n  (where  n*z  =  cos0„)  are 
determined  by  ^(x,y)  from  Eq.  (29).  The  integrals  expressing  the  boundary  conditions  can  then 
be  written  as  two  NxN  matrix  equations  for  the  N  values  of  E,  and  H,  on  the  surface  in  terms  of 
the  N  values  of  the  two  tangential  components  of  J.  These  equations  are  inverted  to  give  J, 
which  is  inserted  in  turn  into  the  finite  approximations  for  the  integrals  for  E^  and  H^,  completing 
the  solution.  This  is  known  as  the  point-matching  method.  The  N  sampled  v^ues  of  the  surface 
roughness  profile  are  generated  by  a  Monte  Carlo  procedure  constructed  to  give  zero  mean  with 
the  desired  root  variance  trand  autocorrelation  function  A(t).  The  scattered  power  is  proportional 
to  <!£'/>,  where  the  average  is  over  an  ensemble  of  such  sequences  of  sampled  values. 

For  example,  Soto-Crespo  and  Nieto-Vesperinas  (1989)  have  computed  the  scattering  from 
ID  infinitely-conducting  random  (Gaussian-correlated)  or  periodic  surfaces.  For  this  geometry, 
the  Green  function  is  a  zeroth-order  Hankel  function  of  the  first  kind.  Interestingly,  there  is  an 
intermediate  range  of  roughness  (between  the  regimes  of  mostly  specular  scattering  and 
enhanced  backscattering  at  small  and  large  values  of  a/X,  respectively)  for  which  (he  reflection  is 
very  nearly  Lambertian,  at  least  for  near-normal  angles  of  incidence.  Furthermore,these 
researchers  compared  their  numerical  results  to  the  analytical  predictions  of  the  Kirchhoff 
approximation  and  were  thus  able  to  prepare  a  useful  graph  indicating  the  range  of  values  of  all 
versus  UX  for  given  angles  of  incidence  for  which  the  two  integrated  scattered  powers  agree  with 
each  other  to  within  a  specified  percentage.  Saillard  and  Maystre  (1990)  have  considered  ID 
random  surfaces  of  finite  but  large  conductivity,  applicable  for  instance  to  gold  at  infrared  (IR) 
wavelengths.  In  this  case,  the  kernels  of  the  integral  equations  are  sharply  peaked  and  hence 
difficult  to  accurately  evaluate  by  the  above  numerical  procedures.  So  instead,  an  “impedance 
boundary  condition”  is  imposed,  whereby  a  local  linear  relationship  between  the  total  field  in  the 
metal  and  its  normal  derivative  on  the  surface  is  assumed.  Knotts  et  al.  (1993)  have  numerically 
evaluated  the  complete  Mueller  matrix  for  these  same  conditions  and  compared  the  results  to 
experimental  measurements  on  well-defined  seimples,  prepared  by  overcoating  a  photoresist  plate 
exposed  to  a  suitable  ID  laser  pattern.  Qualitative  agreement  resulted  when  Gaussian  statistics 
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were  employed,  but  detailed  agreement  required  that  the  sequences  of  surface-height  values  be 
taken  directly  from  digitized  profilometer  data  rather  than  be  generated  by  the  Monte  Carlo 
technique,  indicating  that  rather  subtle  statistical  properties  of  a  surface  can  play  a  significant 
role  in  determining  the  scattering.  This  emphasizes  the  most  important  limitation  of  such 
numerical  techniques:  they  must  be  repeated  from  scatch  for  each  specific  set  of  sample 
properties  and  illumination  characteristics. 

Experimental  Measurements  of  Surface  BRDFs 

In  this  section,  an  assortment  of  experimental  papers  are  briefly  discussed,  but  no  attempt  at 
completeness  is  made — for  this  purpose,  reviews  such  as  the  one  by  Asmail  (1991)  should  be 
consulted.  It  is  logical  to  begin  with  the  reflectometers  used  to  measure  BRDFs  in  the  laboratory. 
Typical  examples  are  described  by  Roche  and  Pelletier  (1984),  Zaworski  etal.  (1993),  and 
Sandmeier  et  al.  (1997).  At  their  hearts  are  a  goniometer  which  permits  one  to  choose  values  of 
an^or  in  certain  ranges,  to  within  specified  angular  resolutions  and  source  and 
detector  solid  angles.  Two  common  design  geometries  are:  type  (i)  where  the  source  and  detector 
are  each  mounted  on  a  pair  of  arc  rails  so  that  they  can  be  scanned  throughout  the  hemisphere 
above  a  stationary  sample;  and  type  (ii)  for  which  the  source  is  fixed  in  place,  the  sample  rotates 
about  the  x  and  y  axes  defining  its  surface,  and  the  detector  orbits  around  the  sample  in  a  single 
plane.  Various  choices  for  the  collimation  or  focusing  of  the  incident  light  are  discussed  in 
Nicodemus  et  al.  (1977).  Sometimes  the  sample  is  rapidly  spun  about  its  normal  direction  to 
average  out  inhomogeneities  such  as  laser  speckle,  and  the  incident  beam  chopped  so  that  lock-in 
detection  can  be  used  to  eliminate  scattering  by  ambient  light.  Oppenheim  et  al  (1994)  have 
made  measurements  on  various  recommended  standards  of  diffuse  (nearly  Lambertian) 
reflectance  in  the  IR  used  in  calibrating  such  instruments. 

Moving  on  to  applications  in  the  domains  of  interest  outlined  in  the  introduction,  Snyder  and 
Wan  (1996)  have  used  a  Fourier-transform  IR  spectrometer  mounted  on  a  type  (i)  goniometer  to 
investigate  soil  samples.  It  proved  necessary  to  make  measurements  both  with  the  source  on  and 
off,  in  order  to  correct  for  sample  heating  by  it.  The  ultimate  goal  is  to  determine  the  BRDF  for 
analysis  of  satellite-based  thermometric  imaging  of  earth’s  surface.  Stavridi  etal  (1997)  have 
exarnined  brick,  tile,  and  concrete  with  a  CCD  camera  in  a  type  (ii)  geometry  for  architectural 
applications.  As  is  typical  of  many  experimental  papers,  these  researchers  have  fitted  their  data 
by  combining  several  models  discussed  in  the  previous  section.  Specifically,  they  chose  a  linear 
combination  of  the  Torrance-Sparrow  and  Oren-Nayar  facet  models.  There  is  no  theoretical 
justification  for  such  ad  hoc  combinations,  and  so  they  are  probably  best  viewed  as  empirical 
fitting  functions  which  work  because  they  contain  a  sufficient  number  of  free  parameters.  Newell 
and  Keski-Kuha  (1996)  have  looked  at  extreme-ultraviolet  (UV)  scatterers  and  baffles  of  interest 
to  astronomers.  Perhaps  not  surprisingly,  materiids  which  perform  well  in  the  IR  and  visible 
spectral  ranges  often  make  poor  choices  for  the  UV.  Similarly,  Watkins  et  al  (1993)  made  s-  and 
p-polarized  measurenients  of  high-reflectance  dielectric  coatings  using  laser  sources  and  an 
InGaAs  or  photomultiplier  detector  in  a  type  (ii)  configuration.  Superpolishing  of  the  substrates 
can  reduce  the  BRDF,  but  generally  only  at  the  design  wavelength — scatter  characteristics  at 
different  wavelengths  tend  to  be  uncorrelated.  Bickel  et  al  (1987;  also  see  lafelice  and  Bickel, 
1987)  have  measured  the  complete  Mueller  matrices  for  smooth  and  distorted  metal  surfaces  by 
periodically  modulating  the  incident  polarization  state  from  a  laser  and  measuring  the 
fundamental  and  second  haimonic  of  the  scattered  light  in  a  type  (ii)  setup;  this  demonstrates  the 
sensitivity  of  polarimetric  optical  scattering  to  surface  perturbations.  An  alternative  technique  for 
finding  the  Mueller  matrix  uses  a  fixed  linear  polarizer  and  a  variable  retarder  in  both  the  source 
and  receiver  arms  (Somsin  and  Chipman,  1996);  an  integrating  sphere  scrambles  the  analyzed 
state  to  compensate  for  any  polarization  sensitivity  of  the  detector.  Burnell  et  al  (1994)  have 
studied  oxidized  nickel  collected  from  the  interior  of  furnace  tubes  for  the  purposes  of 
distinguishing  emitted  and  reflected  radiation  in  pyrometric  measurements.  After  finding  that  the 
data  could  not  be  fit  with  physically  realistic  values  of  the  parameters  in  the  Torrance-Sparrow 
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model,  they  also  modified  it  in  an  ad  hoc  fashion  to  give  a  function  which  worked  better  but 
could  not  be  interpreted  physically.  This  is  reminiscent  of  the  approach  used  to  devise  the 
coupled  model  (Shirley  et  al,  1997)  or  the  (similar)  modified  Beard-Maxwell  model  (Ellis, 
1996a)  for  glossy  surfaces. 
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Abstract 


The  evolution  of  structure  and  mechanical  properties  with  various  heat  treatment  conditions 
was  investigated  in  this  study  for  pitch-based  carbon  fibers.  AR-mesophase  and  SCF- 
mesophase  were  obtained  in  the  form  of  pitch  fibers.  The  fibers  were  first  stabilized  to 
render  them  infusible  and  then  subjected  to  heat  treatment  temperatures  (HTT)  that  ranged 
ffom300°Cto3000°C. 

^e  imique  aspect  of  the  present  study  was  the  characterization  of  dimensional  changes  of 
individual  fibers  in  terms  of  their  length  and  diameter.  The  dimensional  measurements  for 
AR  fibers  indicate  very  clearly  that  starting  from  an  oxidized  state,  the  length  of  the  fibers 
shrinks  about  8%  at  a  HTT  of  900°C.  Above  900°C,  the  length  does  not  reduce  any 
further,  instead  it  increases  slightly.  The  final  shrinkage  of  the  length  (relative  to  the 
oxidized  state)  is  about  6%.  The  slight  increase  can  be  explained  by  the  alignment  of  the 
graphene  layer  planes  along  the  fiber  axis  that  result  in  the  shrinkage  of  fibers  in  the 
transverse  direction  (diameter)  but  an  expansion  along  the  longitudinal  direction. 
Considerably  larger  changes  are  observed  in  the  diameter  of  the  fibers.  Starting  from  the 
oxidized  state,  a  significant  drop  of  15%  is  observed  till  900°C  HTT.  From  9(X)°C  till 
1500°C,  there  appears  to  be  a  slight  increase  in  the  diameter.  Beyond  1500°C,  there  is 
again  a  significant  reduction  of  the  diameter  with  the  ultimate  shrinkage  being  about  20%  at 
HTT  approaching  30(X)°C. 

The  fiber  microstmcture  was  characterized  by  scanning  electron  microscopy  (SEM). 
Consistent  with  the  dimensional  measurements,  the  SEM  examinations  reveal  that  the  first 
major  change  is  observed  at  900°C,  where  a  radial  texture  is  observed.  At  higher  HTT,  the 
development  of  radial  texture  is  more  pronounced  and  by  2400°C  the  graphene-layer  planes 
are  seen  very  clearly.  Wide-angle  x-ray  scattering  (WAXS)  measurements  indicate  that  at 
2875°C  HTT,  the  fibers  have  a  d(x)2  spacing  of  0.3354  nm  indicating  that  the  material  is 
almost  fully  graphitic.  Single-filament  testing  indicates  that  the  tensile  modulus  and 
strength  increase  with  increasing  HTT  till  2100°C,  where  a  slight  drop  is  observed 
followed  by  further  increase  till  the  maximum  HTT  of  2875°C.  The  maximum  modulus 
was  measured  to  be  about  385  GPa  whereas  the  maximum  strength  was  found  to  be  about 
1.8  GPa. 

The  present  study  establishes  that  in  AR  mesophase  pitches,  the  first  significant  change  in 
dimensions  and  microstructure  occurs  at  about  900°C.  Whereas  microstructure  and 
properties  continue  to  evolve  over  higher  heat  treatment  temperatures,  the  next  significant 
change  is  observed  above  2400°C.  These  results  should  help  in  the  establishment  of 
optimum  processing  conditions  for  carbon  fiber/carbon  matrix  composites. 
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Introduction 


Carbon  fibers/carbon  composites  have  unique  thermal  and  mechanical 
properties  [1-3].  However,  their  high  processing  costs  have  limited  their 
use  to  such  applications  where  performance  is  the  critical  controlling  factor 
[4].  The  high  costs  have  prevented  their  use  in  a  number  of  military  and 
civilian  applications. 

One  approach  to  reducing  processing  costs  is  the  simultaneous 
carbonization  of  the  “green”  fibers  and  the  “green”  matrix.  This  approach 
eliminates  one  expensive  carbonization  step,  because  this  approach 
combines  two  carbonization  steps  into  one.  The  science  of  simultaneous 
carbonization  of  fibers  and  matrix,  however,  is  not  well  understood. 

The  processing  and  characterization  of  composites  has  been  a  subject  of 
great  interest.  Several  techniques  have  been  proposed  to  monitor  the 
strucmre  and  the  stresses  generated  during  processing  due  to  a  mismatch  of 
thermal  expansion  coefficients  of  the  various  phases  as  well  as  the  chemical 
changes  taking  place  in  the  fiber/matrix.  In  carbon-carbon  composites,  the 
stresses  are  primarily  caused  by  the  shrinkage  of  the  precursor  material 
when  noncarbon  constituents  (oxygen,  nitrogen,  CO,  C02)  are  evolved 
during  carbonization  and  further  heat  treatment  steps.  This  shrinkage 
results  in  matrix/fiber  cracks  and  leads  to  deterioration  of  physical  and 
mechanical  properties. 

The  microstructure  and  properties  of  pitch-based  fibers  have  been 
investigated  by  a  number  of  researchers  [5-10].  However,  only  a  limited 
number  of  studies  have  dealt  with  dimensional  changes  [1 1,12]. 


Objectives 

To  optimize  processing  conditions,  it  is  important  to  develop  a  clear 
understanding  of  the  dimensional  and  microstructural  changes  taking  place 
in  the  mesophase/precursor  during  the  various  heat  treatment  steps. 
Therefore,  this  study  was  directed  at  the  characterization  of  the  evolution 
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of  microstructure  in  pitch-based  fibers  for  various  heat  treatment 
conditions.  Specific  objectives  were: 

(i)  to  quantitatively  measure  the  dimensional  changes  in  the  fiber; 

(ii)  to  characterize  the  fiber;  and 

(iii)  measure  the  tensile  properties  of  the  fiber  to  obtain  a  relationship 
between  dimensional  changes,  microstructure  evolution,  and 
mechanical  properties. 


Experimental 

Materials 

All  of  the  studies  were  conducted  on  pitch-based  carbon  fibers.  Two 
grades  of  pitches  were  investigated:  AR-mesophase,  provided  by  Mitsubishi 
Gas  Chemical  Company,  Japan,  and  mesophase  fraction  separated  by 
supercritical  fluid  (SCF)  extraction.  The  SCF  precursor  was  provided 
courtesy  of  Prof.  Mark  Thies,  and  the  pitch  fibers  were  spun  courtesy  of 
Prof.  Dan  Edie,  both  at  Clemson  University. 

Processing 

Stabilization 

The  as-spun  pitch  fibers  were  first  stabilized  by  an  air-oxidation  process. 

In  this  process,  the  fibers  are  heated  to  a  temperature  below  their  softening 
point  in  the  presence  of  air.  Whereas  thin  (10  micrometer  diameter)  AR 
pitch  fibers  can  be  stabilized  in  a  temperature  range  of  180°C  to  280°C 
within  a  few  hours,  the  fibers  in  the  present  study,  about  50-100  um,  had  to 
be  oxidized  for  about  3  days  at  200°C.  The  thicker  fibers  were  needed  in 
the  present  study  to  facilitate  the  accurate  measurement  of  length  and 
diameter  for  the  dimensional  study  (  described  in  the  next  subsection).  The 
longer  stabilization  time  is  consistent  with  the  results  of  Singer  and 
Mitchell  [13]  who  showed  that  the  stabilization  process  is  controlled  by  the 
diffusion  of  oxygen  in  the  thick  fibers,  and  that  the  stabilization  time 
required  for  100  um  diameter  fibers  is  more  than  an  order  of  magnitude 
larger  than  that  required  for  10  um  fibers.  The  SCF  fibers  were  stabilized 
in  two  steps:  12  hrs  at  180°C  followed  by  a  10°C/min  heating  to  280°C,  a 
holding  time  of  approximately  17  hours,  and  a  cooldown  at  10°C/min.  At 
the  end  of  the  stabilization  cycle,  the  infusible  state  of  the  fibers  was 
confirmed  by  contacting  the  fibers  with  a  soldering  iron  heated  to  400°C. 
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Carbonization/Graphitization 


The  stabilized  fibers  were  heated  to  a  final  temperature  (Tmax)  ranging 
from  300°C  to  3000°C  in  increments  of  300°C  (10  sets)  at  a  nominal 
heating  rate  of  3°C/min.  Heat  treatment  temperamres  up  to  900°C  were 
attained  in  a  Mellen  furnace  (Webster,  New  Hampshire),  whereas  those 
above  900°C  were  attained  in  a  graphitizing  furnace  (Materials  Research 
Furnace  Inc.,  NH). 

Characterization 

Dimensional 

The  first  approach  that  was  explored  was  the  in  situ  measurement  of 
length  changes  taking  place  in  a  single  fiber  during  heat  treatment.  A 
preliminary  design  was  conceived  for  a  fixture  that  could  be  mounted  in  a 
microbalance.  One  end  of  the  fiber  would  be  attached  to  the  sapphire  rod 
of  the  instrument,  whereas  the  other  end  would  be  held  rigidly  in  the 
carbon/carbon  composite  fixture.  However,  the  major  problem  with  the 
fixture  was  its  inability  to  grip  the  fiber  ends  without  crushing  them. 
Modifications  to  the  fixture  would  have  required  considerable  machining 
effort.  Since  a  machine  shop  was  not  available  at  the  AFRL  Labs,  these 
modifications  will  be  attempted  in  a  follow-up  project  at  Clemson 
University.  However,  another  approach  was  adopted  that  was  ex  situ.  The 
dimensions  were  measured  by  a  Nikon  Microphot-FXL  optical  microscope, 
both  before  and  after  the  heat  treatment  cycle.  The  fibers  were  placed 
individually  on  glass  slides  and  the  length  and  diameter  were  measured 
with  the  aid  of  a  Microcode  indexer  with  a  resolution  of  1  micrometer. 

For  heat  treatment,  the  fibers  were  placed  on  a  graphite  plate  (approx.  2  ft 
diameter)  and  the  position  of  each  fiber  was  carefully  noted.  After  heat 
treatment  in  the  graphitization  furnace,  the  fibers  were  carefully  removed 
from  the  graphite  plate  and  placed  on  the  glass  slides. 

Microstructural 

The  microstructure  was  characterized  by  a  scanning  electron  microscope 
(Personal  SEM,  RJ  Lee  Instruments,  Trafford,  PA).  High  magnification 
images  were  obtained  for  a  limited  number  of  samples  using  a  Hitachi  S900 
high  resolution  microscope.  The  crystalline  content  and  structure  (d002 
spacing,  Lc,  and  La)  of  fibers  processed  at  2875°C  were  measured  by  wide 
angle  X-ray  scattering  (WAXS)  using  a  Huber  Instrument. 
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Tensile  Properties 

The  tensile  modulus,  strength,  and  strain-to-failure  of  fibers  were 
measured  by  conducting  single-fiber  tests  in  a  Sintech  tensile  testing 
machine.  The  gage  length  was  25.4  mm  and  the  extension  rate  was  0.0508 
mm/min.  To  obtain  statistically  significant  values,  the  number  of  samples 
required  for  single-fiber  testing  is  fairly  large,  often  exceeding  20 
replicates  for  a  given  experimental  condition.  Since  time  constraints  did 
not  allow  for  the  testing  of  such  large  number  of  samples  during  this 
summer  project,  about  5  samples  were  tested  for  each  condition.  The 
larger  set  will  be  tested  in  a  follow-up  project  at  Clemson  University. 

Results  and  Discussion 

Dimensional  Changes 

Since  carbonization  involves  the  removal  of  noncarbon  species  from  the 
organic  precursors,  significant  dimensional  changes  (typically  shrinkage) 
take  place  during  processing.  One  of  the  primary  objectives  of  the  present 
study  was  the  careful  measurement  of  such  changes  accompanying  various 
heat  treatment  conditions. 

Figure  1  displays  a  calibration  plot  of  the  readings  from  the  Microcode 
indexer  plotted  against  the  markings  from  a  Bosch  &  Lomb  calibration 
slide.  The  Microcode  measurement  were  accurate  to  within  ±1  um. 

Next,  Table  1  displays  the  length  measurements  for  four  different 
stabilized  AR  fibers.  The  fiber  lengths  used  in  the  present  study  ranged 
from  about  10  mm  up  to  about  40  mm.  Different  lengths  were  purposely 
used  to  enable  unambiguous  identification  of  the  fibers  as  they  were  placed 
on  and  removed  from  the  graphite  plate.  For  a  given  conformation  of  the 
fiber,  the  length  could  be  measured  consistently  with  an  error  of  no  more 
than  ±10  um.  The  diameter  measurements  could  be  performed  consistently 
to  within  ±1  um.  Repeated  measurements  of  fiber  lengths  (after  removing 
and  replacing  the  fibers  on  the  glass  slide)  have  an  error  of  no  more  than 
0.2%,  with  much  of  the  error  arising  from  the  waviness  of  fibers. 


After  heat  treatment,  about  half  of  the  AR  fibers  remained  fairly  straight, 
whereas  the  other  half  curled  slightly.  In  most  cases  the  curvature  was 
small  and  the  length  could  be  accurately  determined  by  adding  the  length  of 
5  to  6  individual  segments.  For  SCF  fibers,  the  extent  of  curling  was 
significant,  the  reason  being  their  diameter  was  only  about  30-50  um  as 
compared  with  50-100  um  for  AR  fibers.  The  ratio  of  the  fiber  length 
before  and  after  heat  treatment  is  reported  as  “L/Lo”. 
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Figure  2  displays  the  length  ratio  as  a  function  of  the  heat  treatment 
temperatures  for  AR  and  SCF  fibers.  As  stated  above,  the  SCF  fiber  tended 
to  curl  up  during  the  heat  treatment  and  as  a  result  the  length  measurement 
had  considerable  variations;  the  changes  in  the  length  ratio  form  one 
temperature  to  the  next  are  statistically  not  significant  at  95%  confidence 
level.  Therefore,  the  results  should  be  treated  as  semiquantitative  trends. 
However,  for  AR  fibers  the  measurements  were  quite  accurate.  Starting 
from  the  stabilized  state,  the  length  of  the  fibers  shrinks  about  5%  at  a  HTT 
of  600°C  and  about  8%  at  a  HTT  of  900°C.  From  900°C  to  1200°C,  there 
appears  to  be  a  slight  increase  in  the  length,  but  this  difference  is  not 
statistically  significant  at  the  95%  confidence  level.  A  summary  of  the 
statistics  is  presented  in  Table  2.  Beyond  1200°C,  the  length  does  not 
shrink,  instead  it  increases  slightly.  The  increase  from  1200°C  to  1500°C 
is  statistically  significant.  The  changes  from  1500°C  to  1800°C  and  1800°C 
to  2400°C  are  statistically  not  significant.  However,  at  the  highest 
processing  temperamres,  the  increase  in  the  length  ratios  is  statistically 
significant.  Thus,  at  the  highest  heat  treatment  temperature  of  2875°C,  the 
fibers  display  a  shrinkage  of  about  6%  as  compared  with  8%  at  the 
intermediate  temperature  of  900°C. 

Considerably  larger  changes  are  observed  in  the  diameter  of  the  fibers,  as 
displayed  in  Figure  3.  The  diameter  measurements  for  the  SCF  fibers 
were  more  consistent  than  their  length  counterpcuts.  However,  because  the 
SCF  fibers  were  thinner  than  the  AR  fibers,  the  SCF  measurements  tend  to 
have  more  error  than  do  their  AR  counterparts.  To  verify  if  the  large 
shrinkage  in  the  diameter  relative  to  that  along  the  length  was  due  to 
surface  oxidation  (from  fugitive  oxygen)  during  heat  treatment,  the 
diameter  ratio  was  plotted  as  a  function  of  the  reciprocal  diameter.  For 
increasing  diameter,  the  extent  of  surface  oxidation  decreases.  Thus,,  in 
the  limiting  case  (as  D  becomes  large  and  the  reciprocal  of  D  approaches 
zero),  the  intercept  on  the  ordinate  could  provide  a  measure  of  the 
shrinkage  in  the  absence  of  oxidative  loss.  Representative  plots  for  600, 
1800,  2400,  and  2700°C  are  presented  in  Figure  4.  However,  the  diameter 
ratio  was  not  found  to  have  any  dependence  on  the  reciprocal  diameter. 
Therefore,  the  role  of  surface  oxidation  was  not  significant  and  the  values 
reported  in  Figure  3  for  the  diameter  ratio  were  the  average  values  (from 
various  do)  for  any  given  HTT. 

Starting  from  the  oxidized  state,  a  significant  drop  of  15%  is  observed  in 
the  diameter  ratio  till  a  HTT  of  900°C.  From  900°C  to  1200°C,  there  is 
an  increase  in  the  diameter  ratio,  but  the  increase  from  1200°C  to  1500°C 
is  statistically  not  significant  for  either  of  AR  or  SCF  fibers.  The  large 


23-7 


drop  in  the  diameter  ratio  at  1800°C  is  statistically  significant  but  cannot  be 
explained  by  a  physical  explanation.  Beyond  1800°C,  there  is  a  slight 
increase  of  the  diameter  for  both  AR  and  SCF  fibers  with  the  ultimate 
shrinkage  being  about  20%  for  the  AR  fibers  and  30%  for  the  SCF  fibers 
at  the  highest  HTT  of  2875°C. 

Microstructural  Characterization 

Scanning  electron  microscopy  (SEM)  was  used  to  characterize  the 
microstructure  of  the  fibers  at  the  various  HTTs.  The  micrographs  depict 
typical  lateral  surfaces  of  the  fibers  as  well  as  the  cross-sections  of  fibers 
failed  in  tension.  Because  of  space  constraints,  it  is  not  possible  to  include 
all  of  the  micrographs  (about  30)  in  this  report;  only  those  at  3  conditions 
will  be  presented  (as-spun,  900°C,  and  2875°C). 

Figure  5  displays  a  micrograph  of  an  as-spun  AR  pitch  fiber.  The 
diameter  of  the  fiber  is  about  80  pm  and  the  cross-section  as  well  as  the 
lateral  surface  of  the  fiber  are  devoid  of  any  texture.  The  fractured 
surface  of  the  fiber  is  typical  of  a  brittle  failure.  The  presence  of  bubbles 
arid  voids  generated  by  the  off-gassing  of  the  AR  mesophase  is  evident  in 
this  micrograph  as  well  as  others.  For  a  stabilized  AR  fiber,  the  cross- 
section  is  again  typical  of  a  brittle  failure.  For  smaller  stabilized  fiber 
(about  40  pm  diameter),  some  radial  texture  can  be  observed.  The  origin 
of  the  texture  is  very  likely  the  radial  orientation  of  the  discotic  AR 
mesophase  molecules  generated  during  fiber  spinning.  Also  longitudinal 
cracks  result  from  weaker  properties  of  the  material  in  the  hoop  direction. 

After  heat  treatment  at  300°C,  there  is  virtually  no  additional  texture 
developed.  In  fibers  treated  to  600°C,  the  presence  of  a  slight  radial 
texture  can  be  detected,  but  the  lateral  fiber  surface  is  still  fairly  smooth. 

The  first  clear  evidence  of  a  radial  texture  is  observed  at  a  HTT  of  900°C, 
as  illustrated  in  Fig.  6.  The  splitting  of  the  fibers  in  a  “Pac-man”  pattern  is 
typical  of  the  anisotropic  pitch-based  fibers.  The  alignment  of  the  discotic 
mesophase  molecules  in  the  radial  direction  results  in  the  material  being 
fairly  weak  in  the  hoop  direction,  and  the  hoop  stresses  generated  during 
the  heat  treatment  are  relieved  by  the  “Pac-man”  split.  Surface  striations 
are  also  observed  for  the  first  time  at  900.  At  1200°C,  the  existence  of  a 
lamellar  structure  has  started  to  emerge,  and  at  1500°C,  the  texture  is 
slightly  more  developed  as  compared  to  that  at  1200°C.  At  higher  HTTs  of 
1800  and  2100°C  (figures  17,  18,  and  19),  there  is  a  gradual  sharpening  of 
the  domains  or  the  texture. 


23-8 


At  2400°C,  the  presence  of  graphene-layer  planes  is  very  clear.  The 
microdomains  (consisting  of  graphene  layers)  can  be  easily  seen  as  can  the 
folding  of  the  graphene  planes.  At  2700°C,  there  is  further  refinement  of 
the  structure.  However,  the  presence  of  some  unusual  fibers  was  also 
detected.  The  fibers  appear  to  have  a  hollow  core,  although  the  length  of 
the  core  could  not  be  determined.  In  these  fiber  too,  the  graphene  planes 
are  well-developed  and  aligned  normal  to  the  inside  surface  of  the  hollow 
core.  At  the  highest  HTT  that  could  be  attained  in  the  present  study, 
2875°C,  the  microdomains  consisting  of  sheet-like  graphene  planes  are 
very  clearly  seen  in  Fig.  7.  Some  unusual  features  were  also  identified 
such  as  the  presence  of  a  peeled  skin  and  a  hollow  core. 

SEM  investigation  was  also  conducted  on  SCF  fibers.  The  observation  are 
similar  to  those  for  the  AR  fibers.  As-spun  SCF  fibers  are  smooth  and 
have  no  texture.  Stabilized  fibers  too  are  fairly  devoid  of  texture,  although 
in  some  fibers  a  very  slight  radial  texture  may  be  observed.  At  a  HTT  of 
300°C,  a  slight  radial  texture  may  be  observed  in  some  fibers;  also  splitting 
is  seen.  At  a  HTT  of  600°C,  some  radial  texture  is  observed  in  the  skin, 
but  not  in  the  core,  but  the  lateral  surface  is  still  smooth. 

The  emergence  of  lamellar  graphene  planes  is  apparent  at  900°C  as  is  the 
presence  of  surface  striations.  At  a  HTT  of  1200°C,  the  existence  of  a 
radial  structure  is  clear,  and  by  1500°C  the  radial  structure  is  observed 
throughout  the  cross-section  of  small  fibers,  but  not  in  a  thicker.  At 
1800°C,  the  layered  structure  of  graphene  planes  is  apparent.  At  2400°C, 
2700°C,  and  2875 °C,  the  presence  of  microdomains  with  graphene  planes  is 
very  clear.  The  unusual  hollow  core  observed  for  AR  is  also  observed  for 
some  SCF  fibers. 

High  resolution  SEM  was  also  used  for  selected  samples  to  obtain  details  of 
the  microstructure.  For  fibers  heat  treated  to  2875 °C,  the  well  developed 
graphene  planes  are  evident.  Images  could  be  obtained  at  magnifications  as 
high  as  50,000X  and  100,000X.  The  high  resolution  images  were  also 
obtained  at  fibers  carbonized  at  900°C.  These  micrographs  illustrate  that 
even  at  the  low  processing  temperature  of  900°C,  graphene-like  planes 
exist.  These  planes  appear  to  be  surrounded  by  a  continuous  matrix  like 
material,  in  contrast  to  the  fibers  treated  at  2875 °C,  where  the  graphene 
layer  appear  to  be  present  discretely. 

The  microstructure  of  the  fibers  heat  treated  to  2875 °C  was  also 
characterized  by  wide  angle  x-ray  scattering  (WAXS).  Only  one  sample 
could  be  investigated  because  the  control  and  data-acquisition  unit  of  the 
XRD  instrument  was  being  repaired.  The  data  from  the  2-theta  and 
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azimuthal  scans  conducted  in  the  symmetrical  transmission  mode  indicate 
that  the  doo2  spacing  for  the  fibers  heat  treated  at  2875°C  was  0.3354  nm. 
This  indicates  that  the  material  is  almost  fully  graphitic  since  the  limiting 
value  of  doo2  spacing  is  reported  as  0.3354  nm.  The  stacking  height,  L^, 
was  determined  to  be  about  36  nm,  whereas  the  was  found  to  be  about 
48  nm.  The  presence  of  a  (103)  peak  confirmed  the  existence  of  3- 
dimensional  crystallinity  (as  opposed  to  turbostratic  structure). 

Tensile  Properties 

The  tensile  properties  were  measured  by  single-filament  testing.  To  obtain 
statistically  significant  conclusions,  the  number  of  replicates  required  in 
single-filament  tests  often  exceed  20.  However,  due  to  time  constraints, 
only  5  samples  were  tested  for  each  HTT  condition.  Also,  the  tensile 
fixture  was  slightly  damaged  and  as  a  result  many  samples  broke  while 
cutting  the  paper  tab.  In  addition,  the  SCF  fibers  were  significantly  wavy 
after  heat  treatment  above  200°C  and  broke  in  the  paper  tab  while  being 
straightened.  The  limited  tensile  strength  and  modulus  data  for  SCF  fibers 
are  displayed  in  Figure  8.  The  differences  in  values  at  various  HTTs  are 
statistically  not  significant.  Therefore,  only  a  qualitative  trend  can  be 
inferred  from  the  graph  that  the  strength  and  modulus  increase  from  the 
lowest  HTT  of  600°C  to  the  highest  HTT  of  2875°C. 

The  tensile  strength  and  modulus  data  for  AR  fibers  are  displayed  in 
Figure  9.  Starting  from  the  lowest  HTT  of  600°C,  the  modulus  increases 
for  higher  HTTs  of  900  and  1200°C;  the  increases  are  statistically 
significant  at  the  95%  confidence  level.  The  moduli  increase  from  a  HTT 
of  1200  to  1500°C  is  statistically  not  significant,  but  those  from  1500  to 
1800°C  and  from  1800  to  2100°C  are.  From  2100°C  to  2400°C,  the 
modulus  decreases  and  the  difference  is  statistically  significant.  The  change 
in  modulus  from  2400  to  2700°C  is  not  significant,  but  the  modulus 
increase  from  2700  to  2875°C  is  statistically  significant.  The  maximum 
modulus  was  measured  to  be  55  Msi  (380  GPa). 

The  tensile  strength  data  for  AR  fibers  follow  a  similar  trend  as  that 
displayed  by  the  modulus  data,  although  the  peak  appeeirs  slightly  earlier  at 
1800°C.  From  1800°C  to  2400°C,  there  is  a  slight  decrease  in  tensile 
strength,  but  beyond  2400°C,  the  strength  increases,  and  a  maximum 
strength  of  253  ksi  (1.8  GPa)  is  observed  at  the  highest  HTT  of  2875°C. 

The  strain-to-failure  data  for  AR  and  SCF  fibers  are  presented  in  Figure 
10.  Because  of  the  experimental  error  associated  with  these  measurements, 
most  of  the  differences  are  statistically  not  significant. 
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The  strength  and  modulus  values  are  somewhat  lower  than  the  values  of  2- 
3  Gpa  and  300-800  GPa  reported  in  the  studies  of  Mochida  et  al  [7]  and 
Pencock,  et  al.  [8].  The  primary  reason  for  the  lower  values  is  that  the 
fibers  chosen  for  this  study  were  considerably  thicker  (ranging  from  30 
micrometers  to  100  micrometers)  than  those  used  in  most  other  studies. 
Also,  as  was  illustrated  in  the  SEM  micrographs,  the  thick  fibers  tend  to 
have  more  defects  in  the  form  of  gas-bubbles,  cracks,  and  the 
characteristics  “Pac-man”  splitting.  However,  it  should  be  recalled  that  the 
thicker  fibers  facilitated  accurate  measurement  of  the  dimensional  changes 
and  were  purposely  chosen  for  the  present  study  where  the  primary 
emphasis  was  characterization  of  the  dimensional  changes  accompanying 
heat  treatment. 


Conclusions 

The  following  conclusions  are  drawn  from  this  study: 

1.  Changes  in  fiber  length  can  be  reliably  measured  for  fibers  that  have 
diameters  in  the  range  50  to  100  um. 

2.  For  the  AR  mesophase  pitch  fibers,  the  maximum  length  shrinkage  of 
8%  occurs  at  a  heat  treatment  temperature  of  about  900°C.  For  the 
highest  HTT  of  2875 °C,  the  total  shrinkage  in  length  was  only  6%.  The 
slight  increase  from  900  to  2875°C  may  result  from  the  alignment  of  the 
graphene  planes  from  the  initial  out-of-plane  orientation  to  the  very 
highly  aligned  state  at  2875°C.  The  shrinkage  in  diameter  is  about  20% 
at  the  highest  HTT  of  2875 °C,  corresponding  to  an  area  reduction  of 
almost  36%  in  the  transverse  plane.  These  values  clearly  establish  the 
anisotropic  nature  of  the  dimensional  changes  during  carbonization. 

3.  High  resolution  scanning  electron  microscopy  images  indicate  that 
significant  development  of  graphene-like  layers  begins  at  about  900°C 
and  continues  till  about  2400°C.  XRD  results  indicate  that  the  AR 
material  heat  treated  to  2875°C  has  a  doo2  spacing  of  0.3354  nm  and  is 
almost  fully  graphitic. 
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Future  Work 


The  most  significant  aspect  of  the  present  study  was  the  characterization  of 
dimensional  changes  together  with  supporting  evidence  from  SEM,  XRD, 
and  tensile  measurements.  Whereas  the  dimensional  measurements  could 
be  successfully  conducted  ex  situ,  the  measurement  of  dimensional  changes 
need  to  be  measured  in  situ  to  obtain  an  estimate  of  the  changes  at  the 
actual  elevated  processing  temperatures.  A  modified  carbon/carbon  fixture 
suitable  for  mounting  in  a  TGA  apparatus  needs  to  be  developed  as  a  part 
of  the  future  work.  Also,  XRD  data  and  tensile  data  need  to  be  generated 
for  the  AR  fibers.  Finally,  the  methodology  developed  in  this  project 
should  be  verified  on  SCF  and  other  fibers. 
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Table  1 .  Replicate  length  values,  mm,  of  stabilized  AR  fibers 


Sample  #  -U  AR  Stabil  Average  LI  :Std  Dev  L1 

RepI  Avg  L2 

StdDevL2  L1/L2 

!  ■  I 

1  :  28.443‘  ■ 

'  28.462  28.452  0.0096 

28.462 

0.0031  0.999 

28.451- 

2  31.024; 

31.041:  31.041 

0.0174 

31.08 

0.0041  0.998 

31.059 

- ^ 

3:  17.337 

1  17.327 

17.334  0.0061 

17.357 

0,0093  0.998 

i  17.338 

4;  17.708 

1 

;  17.74 

17.728!  0.017 

17.76 

0.0059  0.998 

i  17.736 

' 

Table  2.  Comparison  of  length  values  for  as-spun  and  stabilized  AR  fibers. 


1  Sample 

L  AR  As-Spun  ■  Std  Dev  L  AsSf  Lstab/Lassp  1 

1  ^  1 

_ 

1 

28.464  0.0108 

0.9999 

_ : _  ; 

_ 

2 

31.06  0.0081 

1.00061 

_ _  ^ 

3 

17.3227.  0.0106 

I.OO2I 

_ ^ . . ^ 

1 _ 

4 

17.71i;  0.0148 

1.00271 
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Figure  1.  A  calibration  curve  for  the  Microcode  Indexer  Vs  Readings  from  the  Bosch  & 
Laumb  calibration  microslide. 
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Figure  2.  Length  ratio  after  heat  treatment  as  a  function  of  the  heat  treatment  temperature. 
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Figure  3.  Fiber  diamter  ratio  as  a  function  of  heat  treatment  temperatures. 
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Figure  4.  D/Do  ratio  plotted  as  a  function  of  reciprocal  diameters 


Figure  5.  SEM  micrograph  of  as-spun  AR  pitch  fiber  at  a  magnification  of  500X. 


Figure  6.  SEMmicrograph  of  a  AR  fiber  heat  treated  at  900°C  at  a  magnification  of  lOOOX 


strength, 


Figure  7.  SEM  micrograph  of  AR  fiber  heat  treated  to  2875°C  at  a  magnification  of  lOOOX 
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Figure  8.  Strength  and  Modulus  of  heat  treated  AR  fibers. 
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Figure  9.  Strength  and  Modulus  of  heat  treated  SCF  carbon  fibers. 
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Figure  10.  Strain  to  Failure  of  heat  treated  AR  and  SCF  carbon  fibers. 
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Abstract 


A  computer  simulation  to  determine  the  field-pattern  of  arbitrarily  oriented  very  large  phase/time  delay 
scanned  antennas  was  developed.  The  simulation  takes  into  account  errors  present  in  antenna  arrays.  These  are 
systematic  and  random  errors.  The  systematic  errors  considered  here  are  the  finite  quantization  of  the  phase 
produced  by  the  use  of  digital  N-bits  phase/time-delay  shifters  and  the  flexing  of  the  array  aperture  due  to  its  large 
size  and  weight.  The  random  errors  considered  are  those  caused  by  variations  on  the  amplitude  and  phase  of  the 
elements  current,  variations  on  the  radiation  pattern  of  the  elements,  and  missing  elements  (due  to  catastrophic 
failure),  and  variations  in  the  location  of  the  elements.  To  validate  the  simulation  a  number  of  patterns  were 
computed.  These  included  patterns  of  linear  arrays,  array  panels,  and  arrays  of  panels.  Experience  as  well  as 
specific  examples  validated  the  ideal  patterns.  The  “random-error-pattems”  were  compared  to  specific  trends  noted 
in  earlier  studies.  The  behavior  of  the  computed  patterns  confirmed  such  trends. 
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Simulation  of  the  Antenna  Pattern  of  Arbitrarily  Oriented 
Very  Large  Phase/Time-Delay  Scanned  Antenna  Arrays 
With  Systematic  and  Random  Errors 

Carlos  R.  Ortiz,  Ph.D. 


1.0  Introduction: 

The  purpose  of  this  effort  was  to  write  a  simulation  to  compute  the  field  of  arbitrarily  oriented,  very  large 
phase/time-delay  scanned  antenna  arrays  with  systematic  and  random  errors.  As  explained  in  [1],  errors  in  antenna 
arrays  can  be  divided  into  two  categories  depending  on  whether  they  are  predictable  (systematic)  or  random. 
Among  systematic  errors  is  the  finite  quantization  of  the  phase  produced  by  the  use  of  digital  N-bits  phase/time- 
delay  shifter.  Another  error  considered  here  is  the  flexing  of  the  array  aperture  due  to  its  large  size  and  weight. 
Random  errors  are  caused  by  variations  on  the  amplitude  and  phase  of  the  elements  current,  variations  on  the 
radiation  pattern  of  the  elements,  missing  elements  (due  to  catastrophic  failure),  and  others  not  considered  here. 
Random  errors  may  alter  antenna  parameters  such  as,  increasing  sidelobe  level,  reducing  power  gain,  lowering 
directivity,  etc. 

2.0  Theory: 

The  development  of  the  necessary  equations  to  simulate  the  field  pattern  of  arbitrarily  oriented,  very  large 
planar  phase/time-delay  scanned  antenna  arrays  with  systematic  and  random  errors  is  presented  in  this  section. 
Such  development  is  based  on  the  known  principle  of  pattern  multiplication  [1]. 

2.1  Linear  Arrays: 

Consider  the  N-elements  linear  array  shown  in  Figure  1 .  The  array  axis  is  in  the  t  direction.  The  center  of 
the  array  coincides  with  the  origin  of  the  t-axis  and  it  is  specified,  with  respect  to  a  Cartesian  (global)  coordinate 

system,  by  the  position  vector  ^ .  For  an  even  number  of  elements,  the  elements  are  distributed  equally  on  each 
side  of  the  array.  For  and  odd  number  of  elements,  the  center  element  is  located  at  the  origin  of  the  t-axis  with  an 
equal  amount  of  elements  on  each  side.  The  distance  between  any  two  consecutive  elements  is  constant  and  it  is 

represented  by  .”  The  elements  are  considered  to  be  identical,  with  identical  magnitude  and  progressive  phase 

shift  between  consecutive  elements.  Such  array  can  be  referred  to  as  a  uniform  array. 

According  to  the  method  of  pattern  multiplication  the  far-field  pattern  of  an  array  of  identical  elements  is 
equal  to  the  product  of  the  element  pattern  and  the  array  factor  of  the  array.  The  element  pattern  is  the  field  of  a 
single  element  computed  at  a  reference  point  -usually  the  origin  of  the  array.  The  array  factor  is  a  function  of  the 
number  of  elements,  their  geometrical  arrangement,  their  relative  magnitude  and  phases,  and  their  spacing. 
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Figure- 1  N-Elements  Linear  Array 


Thus,  the  transmitting  field  pattern  of  the  array  of  Figure- 1  may  be  written  as: 

Fn{G.(I>)  =  eNiOSfNi^S  •  (1) 

Where  e  NiO^cfi)  represents  the  element  pattern  with  random  error  and  can  be  expressed  as  [2], 

1  "1"  ^  ^  * 


is  the  no-error  element  pattern,  5  are  samples  of  a  random  variable  with  zero  mean  and  variance  ^  ,  and 

the  factor  a:  accounts  for  missing  elements  such  as  might  be  caused  by  catastrophic  failure.  As  in  [1],  it  is  assumed 
that  K  represents  the  fraction  of  elements  that  remain  when  l-AT  elements  have  failed.  On  the  other  hand, 

/v  represents  the  array  factor.  For  now,  we  will  assume  isotropic  elements  with  =  0.0 ,  and  AT  =  1.0 


so  that  e ^{0^(f>)  =  =  1.0.  Now,  (0^(f>)  becomes  the  entire  radiation  pattern.  Thus, 

«=i 

In  Equation  (2) 

P^Ik! X 


(2) 

(3) 


where  X  is  the  wavelength.  /'  is  the  of  the  excitation  with  random  error  at  the  element  which  can  be  expressed 
as  [2], 


Sa„) 


(4) 
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j/^Jand  the  no-error  magnitude  and  phase  of  the  excitation.  TheS  as  well  as  the  ^cr,,are  samples  of 

random  variables  assumed  to  have  a  normal  distribution  with  zero  mean  and  variance  and  (J^sa„, 
respectively.  The  direction  of  transmission  is  given  by, 

w=cos^,  sin^,jc+sin^,  sin^^  j)+cos^,  z  .  (5) 

The  distance  vector  from  the  ^  element  to  the  observation  point  P  is  denoted  by  r„  which  can  be  expressed  as, 

Kg  ■  (6) 

Fq  is  the  position  vector  of  the  observation  point  P,  while  is  the  position  vector  of  the  n  element  with 
respect  to  the  origin  of  the  Cartesian  coordinate  system  and  can  be  expressed  as, 

Kg=rc  +  Ki- 
Where, 

r^=x^x  +  y^y->i-z^z  (8) 

and  F'^i  is  the  position  vector  of  the  element  with  respect  to  the  local  coordinate  system  n  =  t  xv  with  origin 
at  the  center  of  the  array.  In  terms  of  their  scalar  components  along  the  Cartesian  unit  vectors,  the  unit  vectors  along 

the  t,  V  and  n-axis  are  f  =  X  +  j)  -f  f ,  v  =  x  +  j)  +  v,  z ,  and,,  h  =  n^X  +  +  n^z , 

respectively. 

Using  the  equations  described  above,  and  making  the  usual  far-field  approximations. 

Equation  (2)  becomes. 


«  =  1 

Where,  |/;,|=  (\  +  SI„)\l„\, 


sin  d,  cos  (!>,  +y'„  sin  Oj  sin^,  +z'„  cosO, ) 


yh=  I  “  /2  +  I  ^  +  /(O  . 


^«=  /2  +^v  +/(0Wz* 

In  Equations  (1 1)  -  (13),  is  the  length  of  the  linear  array  along  the  t-axis.  In  arriving  at  these  equations,  it  was 
assumed  that  the  location  of  the  elements  is  a  random  variable  with  independent  distributions  along  the  transversal 
and  vertical  directions.  Both  distributions  are  assumed  to  be  normal  with  mean  equal  to  the  “real”  location  of  the 

elements  and  with  a  respective  standard  deviation  andCT^  .  The(^,  and  represent  samples  of  such 
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distributions.  In  the  code,  andcT^^ ,  are  specified  as  inputs.  f(t)  is  the  flexing  of  the  array  aperture  along  the 

t-axis.  Here,  it  is  assumed  that  f(j)  varies  linearly  from  the  center  to  the  edge  of  the  array.  The  slope  of  this 
variation  is  an  input  to  the  code  also. 


2.1.1  Beam  Scanning: 

Two  different  options  for  beam  scanning  were  incorporated  into  this  simulation.  These  are  phase  scanning 
and  true-time  delay  scanning.  These  may  be  used  for  scanning  the  main  beam  of  the  field  radiated  by  either  the 
individual  panels  as  well  as  that  of  the  array  of  panels.  Both  options  are  discussed  presently. 

2.1. 1.1  Phase  Scanning: 

By  prescribing  the  elements  with  a  progressive  phase  shift  given  by, 

,  (14) 

it  is  possible  to  point  the  main  beam  in  a  given  direction,  say  Uq  .  Equation  (9)  can  be  written  now  as, 

.  (15) 

w=l 

The  above  expression  produces  a  maximum  radiation  in  a  direction  given  by, 

Uq  =  smdQ  cos^qX  +  sin^o  siik/^q  y  +  cosO^z.  (16) 


2.1. 1.2  True  Time-Delay  Scanning: 

The  following  description  is  a  generalization  of  the  description  found  in  [3].  In  the  true  time-delay  method, 

the  progressive  phase-shift  is  achieved  by  using  N-bit  “time-shifters”  as  opposed  to  phase  shifters.  For  the 
element,  this  is  achieved  by  propagating  the  excitation  of  the  element  through  a  delay  line  whose  length  is  designed 
to  provide  a  time  delay  given  by, 

4  (<^o.«>o)  =  (Wo  (17) 

where  c  =  speed  of  light  in  free-space.  For  all  frequencies  ry  =  2  ;r  /,  where  /  is  the  frequency  of  the  excitation 
in  Hertz,  the  phase  of  the  excitation  is  now  given  by, 

A= (18) 

Substituting  Equation  (18)  into  Equation  (15),  we  obtain, 

(,9) 

H  =  1 

which  also  produces  a  maximum  radiation  in  the  direction  In  this  case,  however,  the  time-delay  factor  is 

independent  of  frequency. 
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2.1.1.3  Phase/Time-Delav  Quantization  Error: 

The  quantized  phase/time-delay  factor  results  from  the  use  of  N-bits  phase/time-delay  shifters.  The  number 
of  bits  N,  is  an  input  to  the  computer  program.  Due  to  the  quantization  process,  the  actual  phase/time-delay  factor 
realized  may  not  be  exactly  equal  to  the  specified  value.  The  resulting  quantization  error  may  be  expressed  as, 

Where,  5  is  the  error, is  the  specified  phase/time-delay  factor  as  given  by  either  Equation  (14)  or  Equation 


(18),  and  is  the  quantized  phase  time/delay  shift  factor. 

depending  on  weather  P^  is  smaller  or  larger  than  P^ . 
Equation  (9)  becomes, 

p  _  ^ |y f  I  ^jSa„  ^  j\p {^n  sin cos sin 0,  sin +  cos 0,)  +  ^, ] 

n=\ 


Note  that  S  P^  can  be  either  positive  or  negative 
Considering  the  phase/time-delay  quantizing  error, 

(21) 


2.1.2  Pattern  Synthesis: 

It  is  often  of  interest  to  achieve  a  narrow  main  beam,  accompanied  by  a  low  side  lobe  level.  One  of  the 
most  important  methods  is  the  Taylor  Line-Source  Design  Method  [4].  Taylor  perfected  this  method  for  continuous 
line  source  antennas.  Various  authors,  including  Villenueve  [5]  have  developed  several  approaches  of  applying 
Taylor’s  method  to  arrays  of  discrete  elements.  Villenueve  showed  that  for  discrete,  uniformly  spaced  arrays, 
exciting  the  array  elements  with  samples  of  the  continuous  distribution  produces  little  differences  in  the  excitations. 
Thus,  the  pattern  realized  by  exciting  the  elements  with  samples  of  the  continuous  distibution,  is  unditinguishable 
from  the  pattern  realized  by  exciting  the  elements  with  the  corresponding  distribution  for  discrete  arrays.  The 
agreement  improves  for  larger  arrays.  Here  we  use  the  sampling  method  to  compute  the  excitation  of  the  elements. 

2.2  Array  Panels: 

In  this  section,  the  pattern  of  a  planar  N  by  M  elements  array  is  discussed.  The  planar  array  is  shown  in 
Figure-2.  Such  array  may  be  constructed  by  aligning  M  linear  arrays-of  the  type  described  in  section  2.1-  along  the 

v-axis.  The  spacing  between  the  M  linear  arrays  along  the  v-axis  is  d^, .  Recalling  the  principle  of  pattern 
multiplication,  the  field  pattern  of  the  planar  array  can  be  expressed  as, 

In  this  case,  the  element  pattern  is  the  field  pattern  of  a  linear  array  given  by  Equation  (1)  and  re-stated  here  as: 

eA0,^)  =  eli0J)fA0J)-  (23) 

Thus, 

p4) 

Where,  /yy  is  given  by  Equation  (2)  (for  isotropic  elements)  and 
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Figure-2  N  by  M  Elements  Planar  Array 


M 


(25) 


m=\ 


In  Equation  (25),  |/'J  is  the  magnitude  of  the  excitation  current  with  error  at  the  element  given  by, 

Kl=  ('  +  <S/.)|/.|  .  (26) 

I^Hil  is  ilie  magnitude  of  the  no-etror  excitation  and  S  /„  are  samples  of  a  notmal  distribution  with  zero 

mean  and  variance  ^ •  Also,  5  is  the  random  error  in  the  phase  of  the  excitation.  Here,  is  the  quantized 
phase/time-delay  factor,  which  can  be  computed  in  a  manner  similar  to  the  one  described  in  section  2.1.1.  The 

phases  and  are  independent,  but  usually  adjusted  so  that  the  main  beam  of  is  directed  in  the  same  direction. 
Finally, 

<)  =  ^c+  +('”-!)  j  V,  +  ^,  V,  +  /(v)  n, ,  (27) 

(28) 

(29) 


y'm  =  yc+  ;>2  +  ('"-O  <  Vy+^,  ty  +^„  +/(v)«^. 


=  •^c+  /^  +  (W  - 1)  V,  +  S,  t,  +  V,  +  /(v)  «. . 


Z,,  is  the  length  of  the  array  along  the  v-axis  and  / (v)  is  the  flexing  of  the  array  aperture  along  the  v-axis.  It  is 
assumed  that  / (v)  varies  linearly.  The  remaining  variables  were  defined  in  previous  sections.  We  can  consider 
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elements  which  are  not  isotropic  in  which  case  e{9^  (p)  is  directive.  Included  in  the  code  are  three  different  options 
for  the  element  pattern.  These  are  isotropic,  cosine  squared  and  cardiod  element  patterns. 

2.3  Array  Of  Panels: 

The  procedure  described  in  section  2.2  can  be  used  to  compute  the  pattern  of  an  array  of  N  by  M  panels.  In 
this  case,  the  individual  array  elements  shown  as  dots  in  Figure-2  are  panels.  The  element  pattern  of  the  array  of 
panels  is  the  field  pattern  of  a  reference  panel.  The  pattern  of  a  panel  may  be  computed  using  the  procedure 
described  in  section  2.2  also.  This  can  be  expressed  simply  as, 

(30) 

is  the  field  of  the  array  of  panels,  (0,  <f>)  is  the  element  pattern  of  a  reference  panel  given  by  Equation  (25) 

and  is  the  array  factor  of  the  array  of  panels  which  can  also  be  computed  as  in  Equation  (25).  These 

ideas  were  used  here  to  write  a  code  capable  of  computing  the  field-pattern  of  an  N  by  M  panels  antenna  array. 

3.0  Results: 

The  purpose  of  this  section  is  twofold.  First,  it  serves  as  a  validation  of  the  results  obtained  with  the  code 
described  here.  Second,  it  serves  as  an  implicit  demonstration  of  the  potential  of  the  code.  For  the  sake  of 
completeness,  patterns  for  linear  arrays,  array  panels  and,  an  array  of  panels  are  included  here.  Some  results  for 
directivity,  gain  and  Sidelobe  level  are  also  included.  All  patterns  are  computed  at  increments  of  half  a  degree  in 
elevation  and  a  degree  in  azimuth  to  insure  convergence.  The  linear  arrays  are  oriented  along  the  x-axis  so  that  the 
t-axis  coincides  with  the  x-axis.  The  array  panels  are  oriented  in  the  x-y  plane  so  that  the  t-axis  and  the  v-axis 
coincide  with  the  x-axis  and  y-axis,  respectively.  All  patterns  are  computed  in  the  x-z  plane.  The  random-error 
patterns  shown  here  were  computed  using  the  following  data:  RMS  amplitude  error  =  0.002  units. 
RMS  phase  error  =  10.00  degrees,  RMS  error  in  the  elements  location  in  both  x  and  y  directions  =  0.002  cm,  and 
RMS  error  in  element  pattern  =  0,0.  Unless  stated  otherwise,  the  element  spacing  is  .5  cm,  which  corresponds  to  a 
A  /  2  element  spacing  at  30  GHz.  These  values  were  chosen  accordingly  to  examples  found  in  the  literature  [6]. 

3.1  Linear  Arrays: 

Patterns  shown  in  Figure-3  correspond  to  the  normalized  field-pattern  of  a  20-elements  linear  array.  Two 
graphs  are  shown  in  Figure-3.  The  solid  line  corresponds  to  the  normalized  pattern  of  the  array  with  uniform 
illumination.  Experience  as  well  as  results  found  in  the  literature,  validate  the  field  pattern  of  this  uniformly  excited 
array.  For  instance,  the  directivity,  gain,  and  sidelobe  level  were  computed  for  this  array  also  with  the  code 
described  here.  The  directivity  is  10,  the  gain  10  dB,  and  the  sidelobe  level  -13.3  dB.  These  quantities  further 
validate  the  results.  The  dotted  line  corresponds  to  a  Taylor  synthesized  pattern  with  -25  dB  sidelobe  level 
(«  =  5  ).  The  Taylor  weightings  where  validated  with  results  found  in  the  literature  [4].  The  gain,  directivity,  and 
sidelobe  level  for  this  case  array  are  9.10,  9.59  dB,  -25.1  dB  as  expected.  Figure-4  shows  the  pattern  for  the  same 


24-9 


20-elements  Taylor  sinthesized  array  scanned  to  45° .  In  this  figure,  the  solid  line  represents  the  no-error  while  the 
dotted  line  corresponds  to  the  error-pattern.  In  computing  this  pattern  only  random  errors  were  considered. 


Elevation  Angle  Theta  (Degrees)  Ele\etion  Angle  Theta  (Degrees) 


Figure  3.  Pattern  for  a  20-eletnents  linear  array  with  uniform  Figure  4.  Pattern  for  a  scanned  20-elements  linear  array  with 

illumination  (solid).  -25  dB  design  sidelobe  level  (dotted)  uniform  illumination  (solid),  -25  dB  design  sidelobe  level  (dotted) 


3.2  Array  Panels: 


In  this  section,  a  number  of  no-random-error  as  well  as  random-error-  patterns  are  shown.  The  error- 
patterns  are  validated  against  examples  found  in  the  literature  [6],  However,  these  examples  correspond  to 
average  patterns  in  a  statistical  sense.  Meanwhile,  the  patterns  shown  here  are  computed  considering  actual 
random  variations.  The  validation  process  consists  of  confirming  various  trends  in  tolerance  analysis  reported 
in  the  literature  [6].  For  a  given  set  of  tolerances,  these  trends  are;  (1)  the  rise  in  sidelobe  level  due  to  random 
error  increases  as  design  sidelobe  level  decreases;  (2)  the  pattern  deterioration  decreases  as  the  array  is 
enlarged;  (3)  the  sidelobe  level  increase  due  to  random  error  does  not  depend  on  scan  angle;  (4)  pattern 
deterioration  is  larger  for  an  L  elements  linear  array  than  for  an  L  X  L  elements  array;  and  (5)  the  pattern 
deterioration  is  mostly  a  result  of  translational  errors  in  the  position  of  the  elements.  A  number  of  patterns  were 
plotted  to  examine  these  trends.  These  are  shown  in  Figures  (4)  to  Figure  (8).  In  these  figures,  the  no-error 
patterns  are  plotted  with  solid  line  while  the  error-patterns  are  plotted  with  a  dotted  line. 

Figure-4  (a)  shows  the  pattern  for  a  10  X  10  elements  planar  array  with  -25  dB  design  sidelobe  level 
(«  =  5  ),  while  Figure-4  (b)  show  the  pattern  for  the  same  planar  array  but  with  a  —40  dB  («  =  10 )  design 
sidelobe  level.  The  rise  in  the  sidelobe  level  is  readily  noticed.  Thus,  confirming  the  first  trend. 

The  second  trend  may  be  examined  through  Figure-5.  Figure-5  (a)  shows  the  pattern  for  a  12  X  12 
elements  array,  while  Figure-5  (b)  shows  the  pattern  for  a  20  X  20  elements  array.  Both  arrays  have  a  -30  dB 

(n  =  7)  Taylor  distribution.  It  is  obvious  that  the  rise  in  the  sidelobe  level  is  smaller  for  the  larger  array. 
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Magnitude  (dB)  Magnitude  (dB) 


(a)  (b) 

Figure-4.  Error  (dotted)  vs.  No-Error  (solid)  Pattern  for  10  by  10  elements  planar  array  with  (a)  -25  dB  Taylor  Distribution  (b)  -40  dB  Taylor  Distribution 


(a)  (b) 

Figure-5.  Error  (dotted)  vs.  No-Error  (solid)  pattern  for  a  (a)  12  by  12  elements  array  (b)  20  by  20  elements  array  with  -30  dB  Taylor  Distribution 


Figure-6  Error  (dotted)  vs.  No-Error  (solid)  pattern  for  a  15X15  elements  array  scanned  (a)  scanned  10  degrees  (b)  40  degrees  from  broadside 
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Figure-6  considers  the  third  trend.  In  this  case,  the  pattern  of  two  15  X  15  elements  arrays  are  shown. 
Each  array  has  a  —25  dB  (  «  =5)  Taylor  distribution.  A  close  look  at  the  figure  might  suggest  that  in  the  average 
the  sidelobe  level  variation  be  about  the  same  in  both  cases.  To  examine  the  fourth  trend,  we  consider  Figure-2 
and  Figure-7.  Figure-2  shows  the  error  vs.  no-error  pattern  for  a  20  elements  linear  array  with  a  -25  dB  (  «  =5) 
Taylor  distribution,  while  Figure-7  shows  the  error  vs.  no-error  pattern  for  a  20X20  elements  array  with  the 
same  illumination.  Examination  of  the  two  figures  confirms  the  trend.  Finally,  in  Figure-8,  the  error  pattern 
considering  translational  errors  only  emd  the  error  pattern  considering  excitation  errors  only  are  both  plotted  vs 
the  no-error  pattern.  The  array  considered  here  is  a  10  XIO  elements  array  with  a  -25  dB  (n=5)  Taylor 
Distribution.  The  solid  line  corresponds  to  the  no-error  pattern  while  the  dotted  line  corresponds  to  the 
translational  errors  only  pattern.  The  dashed  line  corresponds  to  the  excitation  errors  only  pattern.  This  plot 
superimposes  precisely  over  the  no  error  pattern.  Thus,  confirming  the  fifth  trend. 


Figure  7.  Error  (dotted)  vs.  No-Error  (solid)  pattern  for  20X20  planar  array  Figure-8  Error  (dotted/dashed)  vs.  No-Error  pattern  for  10X10  planar  array 


For  completeness,  this  section  is  concluded  with  a  figure  showing  the  “systematic-error-pattem”  vs  no¬ 
error  of  a  20X20  elements  array  with  —25  dB  Taylor  distribution.  The  systematic  errors  considered  here  are  flexing 
of  the  face  of  the  array  and  N-bit  quatization  phase/time-delay  shift  error.  The  flexing  of  the  array  is  assumed  to 
vary  linearly  from  the  center  to  the  edge  of  the  array.  In  the  results  shown  in  Figure-9  the  slope  of  the  linear 
variation  was  chosen  so  that  the  maximum  flexing  would  be  0.002  cm.  Also,  3-bit  phase-shifters  are  used  to  steer 
the  main  beam.  It  appears  as  if  systematic  errors  had  a  slightly  less  impact  on  the  array  pattern  than  random  errors. 


Figure-9  Systematic-Errors  vs.  No-Error  Pattern  for  a  20  X  20  elements  anay 
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3.3  Array  of  Panels 

In  this  case,  we  consider  the  field  pattern  of  an  array  of  panels.  The  specific  array  considered  here  is  based 
on  the  low-altitude  space-based  radar  (LASBR)  described  in  the  literature  [7],  The  LASER  is  a  13.8tn  X  63.3m 
planar  array.  It  consists  of  49,152  elements  distributed  over  32  array  panels.  The  specific  arrangement  of  the 
panels  as  well  as  that  of  the  elements  within  a  panel  is  unknown.  The  gain  for  LASBR  is  53  dB.  It  operates  at  a 
center  frequency  of  1.275  GHz.  The  array  of  panels  considered  here  is  a  13.1m  X  63.3m  array.  It  consists  of 
48,960  elements  distributed  over  32  panels.  The  panels  are  arranged  in  an  array  of  4  X  32  panels,  with  90  X  17 
elements  per  panel.  The  gain  for  this  array  is  38.8  dB.  It  is  considerably  lower  that  the  LASBR,  but  these  two 
arrays  are  not  exactly  the  same.  The  array  considered  here  also  operates  at  1.275  GHz.  Figure-9  shows  the  no¬ 
error  pattern  for  the  array  this  array.  Figure-9  shows  the  “random-error-patter.”  The  patterns  are  computed  in 
the  y-z  plane.  A  considerable  increase  in  sidelobe  level  may  be  observed  in  the  error  case.  The  RMS  error  in 
element  location  is  2mm,  which  may  account  for  the  considerably  large  increase  in  sidelobe  level. 


Figure- 10.  (a)  No-Error  Pattern  (b)  Error  Pattern  for  a  4X32  panel  array 


4.0  Concluding  Remarks: 

A  computer  simulation  to  determine  the  field-pattern  of  arbitrarily  oriented  very  large  phase/time  delay 
scanned  antennas  was  developed.  The  simulation  takes  into  account  errors  present  in  antenna  arrays.  These  are 
systematic  and  random  errors.  The  systematic  errors  considered  here  are  the  finite  quantization  of  the  phase 
produced  by  the  use  of  digital  N-bits  phase/time-delay  shifters  and  the  flexing  of  the  array  aperture  due  to  its  large 
size  and  weight.  The  random  errors  considered  are  those  caused  by  variations  on  the  amplitude  and  phase  of  the 
elements  current,  variations  on  the  radiation  pattern  of  the  elements,  and  missing  elements  (due  to  catastrophic 
failure),  and  variations  in  the  location  of  the  elements.  To  validate  the  simulation  a  number  of  patterns  were 
computed.  These  included  patterns  of  linear  arrays,  array  panels,  and  arrays  of  panels.  Experience  as  well  as 
specific  examples  validated  the  ideal  patterns.  The  “random-error-pattems”  were  compared  to  specific  trends  noted 
in  earlier  studies.  The  behavior  of  the  computed  patterns  confirmed  such  trends. 
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A  simulation  like  this  is  a  useful  tool  for  assessing  the  effect  of  tolerances  on  the  performance  of  an  antenna 

array.  It  can  also  be  used  as  a  design  tool  to  determine  required  design  parameters  necessary  to  achieve  a  desired 

sidelobe  level.  There  are  some  areas  where  the  code  could  be  improved.  For  instance,  other  types  of  element 

patterns  could  be  considered.  Moreover,  the  simulation  could  be  modified  to  accept  measured  element  patterns  data. 
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FLUTTER  PREDICTION  METHODS  FOR  AEROELASTIC  DESIGN  OPTIMIZATION 


Ramana  M,  Pidaparti 
Associate  Professor 
Department  of  Mechanical  Engineering 
Purdue  University  at  Indianapolis  (lUPUI) 


Abstract 


In  the  design  of  flight  vehicles,  dynamic  flutter  instability  is  a  critical  parameter  that  must  be  considered  in 
various  design  phases.  Due  to  the  numerically  intensive  nature  of  design  optimization,  accurate  and  efficient  method 
of  flutter  prediction  is  needed.  The  objective  of  the  report  is  to  review  and  investigate  the  flutter  prediction  methods 
for  realistic  examples  used  in  aircraft  preliminary  design.  Specifically,  demonstrate  the  flutter  predictions  with 
recently  developed  method  using  Eigenvector  Orientations  (EVO).  Comparisons  of  flutter  predictions  between  the 
popular  V-g  method  and  Eigenvector  Orientation  method  are  presented.  The  examples  studied  include  a  sweptback 
untapered  wing,  jet  transport  wing/aileron,  and  intermediate  complexity  wing.  Results  presented  illustrate  that  the 
EVO  method  can  predict  the  onset  of  flutter  for  aircraft  wings  used  in  design  optimization  studies.  Based  on  the 
results  obtained,  the  EVO  method  shows  the  feasibility  of  automating  the  flutter  prediction  process  and  achieving 
real  time  control. 
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FLUTTER  PREDICTION  METHODS  FOR  AEROELASTIC  DESIGN  OPTIMIZATION 


Ramana  M.  Pidaparti 


Introduction 


Design  of  modem  flight  vehicles  requires  multidisciplinary  design  optimization,  MDO  (integration  of 
stmctures,  aerodynamics,  control  and  propulsion  disciplines)  to  achieve  an  optimum  design  based  on  certain 
performance  criteria  and  constraints.  In  the  design  of  flight  vehicles,  flutter  is  a  critical  parameter  that  must  be 
considered  in  the  early  stage  of  design  cycle.  Also,  with  the  aging  aircraft,  certain  restrictions  need  to  be  imposed  to 
avoid  flutter  instability  during  the  flight  regime  of  the  aircraft.  A  large  number  of  parameter  changes  must  be 
investigated  in  the  flutter  design  of  military  as  well  as  commercial  aircraft  due  to  complexity  of  modem 
composite/metallic  stmctures.  Due  to  the  numerical  nature  of  search  optimization  in  MDO,  accurate  and  efficient 
methods  for  flutter  prediction  are  needed  for  design  applications. 

There  are  several  aeroelastic  design  optimization  codes  being  used  for  aircraft  and  spacecraft  stmctures 
such  as  Automated  Stmctural  Optimization  System  (ASTROS)  [1],  and  MSC-NASTRAN  [2].  In  aeroelastic  design 
optimization,  predominant  dynamic  flutter  instability  needs  to  be  investigated  for  most  flight  vehicles.  Several 
methods  are  being  used  to  predict  the  onset  of  flutter  instability  in  aeroelasticity  design  optimization.  Most  of  these 
methods  are  based  on  complex  eigenvalues  (p-k  method,  K-method  or  KE  method)  [2].  Recently,  Afolabi,  Pidaparti 
and  Yang  [3]  developed  a  method  based  on  eigenvector  orientations  (EVO)  to  predict  the  onset  of  flutter  instability. 
In  their  work,  the  method  of  eigenvector  orientations  was  demonstrated  for  a  limited  examples  of  panel  flutter.  To 
gain  a  better  understanding  of  the  eigenvector  orientation  method  of  flutter  prediction,  further  investigations  need  to 
be  carried  out  on  realistic  aircraft  stmctures.  This  understanding  will  help  to  devise  a  prediction  methodology  that 
may  be  implemented  in  MDO  codes  for  preliminary  design  of  flight  vehicles. 

The  objective  of  this  report  is  to  review  and  investigate  the  flutter  prediction  methods  for  realistic  examples 
used  in  aircraft  preliminary  design  and  further  validate  the  eigenvector  orientation  method  [3].  A  brief  review  of 
commonly  used  flutter  prediction  methods  is  described  in  the  next  section.  Three  different  examples  are  studied  and 
the  results  of  flutter  prediction  are  compared  between  V-g  method  and  EVO  method.  These  comparisons  may  serve 
as  a  benchmark  for  predicting  the  onset  of  flutter  condition  for  design  applications. 
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Flutter  Prediction  Methods 


The  aeroelastic  system  of  equations  for  an  aircraft  structural  component  can  be  written  as 

[M]{x}  +  [C]{x}  +  ([/f]+  J){x}  =  {0}  (1) 

where  [M]  is  the  mass  matrix,  [K]  is  the  stiffiiess  matrix,  [C]  is  the  damping  matrix  due  to  aerodynamic  or  stmctural 
somces,  and  [K^]  is  the  unsteady  aerodynamic  stiffness  matrix.  The  above  system  of  equations  can  be  solved  as  a 
complex  eigenvalue  problem  by  assuming  the  unknowns  {x}  to  follow  a  harmonic  motion.  For  a  large  system, 
normal  modes  approach  can  be  used  to  reduce  the  aeroelastic  system  of  equations.  Usually,  a  free  vibration  analysis 
(by  neglecting  the  Hamping  and  aerodynamic  stiffness)  is  performed  to  obtain  the  natural  frequencies  and  mode 
shapes.  Then,  using  a  sufficient  number  of  lowest  modes  (usually  6-12  modes),  the  original  aeroelastic  system  (Eq. 
1)  can  be  reduced  as  follows, 

[m]{q}  +  [c]{q}  +  ([^]  +  [^  =  {0}  (2) 

where  [m],  [c],  [k],  and  [kj  are  the  modal  mass,  damping,  stmctural  stiffness  and  aerodynamic  stiffness  matrices, 
respectively.  The  system  of  equations  in  Eq.  (2)  results  in  a  complex  eigenvalue  problem  corresponding  to  a  given 
value  of  reduced  frequency  (k)  or  aerodynamic  parameter  (q).  The  flutter  boundary  is  obtained  when  the  lowest  two 
eigenvalues  coalesce  at  a  critical  value  of  k  or  q  (assuming  no  aerodynamic  damping),  or  the  damping  (g)  of  one  of 
the  modes  is  zero  when  crossing  from  negative  (stable)  to  positive  (unstable)  values.  Different  solution  techniques 
are  being  used  in  the  literature  to  obtain  the  critical  flutter  boundary  (flutter  speed  and  frequency)  which  are  briefly 
described  below. 

K-  Method  (V-g  analysis) 

In  the  K-method  of  flutter  solution,  the  complex  eigenvalue  problem  for  V-g  aeroelastic  analysis  can  be 
written  as. 


{(l  +  ig)[^]-®  '(['«]+ [^])}{<l>}  =  {0} 


(3) 


where  [k]  and  [m]  are  the  modal  stiffness  and  mass  matrices,  and  [A]  is  the  complex  aerodynamic  matrix  derived 
from  some  aerodynamic  theory,  for  example,  doublet  lattice  method  for  subsonic  flight  regime  [5];  constant 
pressure  panel  theory  [6]  or  piston  theory  [7]  for  supersonic  flight  regime.  In  equation  (3),  the  parameter  g  is  the 
artificial  stmctural  damping,  and  co  is  the  frequency.  Rewriting  the  above  equation  as. 
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{[^]-X([m]+[^])}{(t)}  =  {0} 


(4) 


where  X  —  G)V(H-ig)  is  the  complex  eigenvalue  and  {(j)}  is  the  complex  eigenvector.  The  complex  eigenvalue  can  be 
written  as, 


X  =  a  ’h  ib 

where  the  damping,  g  is  defined  as  (-b/a)  and  the  velocity  V  is  given  by 


(5) 


(6) 


The  frequency,  f  (Hz)  is  given  in  terms  of  reduced  frequency,  k  by 

nc 


(7) 


In  the  K-method  flutter  solution,  the  complex  eigenvalue  problem  is  solved  for  each  reduced  frequency  (k) 
or  velocity  (V)  until  the  flutter  condition  is  obtained  (damping  becomes  zero).  The  damping  values  (g)  for  each 
mode  is  plotted  against  the  airspeed  (V)  in  a  V-g  plot.  The  flutter  reduced  frequency  (kf)  is  obtained  when  the 
damping  of  one  of  the  modes  is  zero  when  crossing  from  a  negative  value  to  a  positive  value.  Damping  may  not  be 
a  single  valued  function  of  velocity  (V)  in  the  K-method  when  multiple  eigenvalues  exist  in  the  system.  When 
reduced  frequency  or  velocity  1/k  is  used,  the  damping  becomes  single  valued  function.  The  flutter  speed  (Vf)  is 
obtained  from  kf,  and  then  the  flutter  frequency  is  obtained  using  Eq.  (7).  Misidentification  of  flutter  stability  may 
be  obtained  if  mode  crossings  are  not  properly  tracked  during  flutter  analysis  [4]. 

The  K-method  has  been  extended  to  KE  method  in  MSC/NASTRAN  by  neglecting  the  damping  and 
solving  the  resulting  eigenvalue  problem  as  an  efficient  way  to  find  the  flutter  condition.  The  modes  are  extracted  in 
a  specific  order  and  tracked  to  determine  the  flutter  instability  when  damping  becomes  zero  for  one  of  the  modes 
similar  to  the  K-method.  This  method  does  not  extract  eigenvectors  from  the  flutter  analysis. 
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p-k  Method 


In  the  p-k  method  of  flutter  solution,  the  complex  eigenvalue  problem  for  V-g  aeroelastic  analysis  can  be 
written  as, 


(8) 


where 


X  =  (,V  !  by  =  (a  +  iY  (9) 

In  the  above  equation,  p  is  the  root  of  the  p-k  aeroelastic  analysis  equation  (8)  which  is  defined  as 
p=k(y+I),  with  Y  being  the  true  damping  in  the  system.  The  aerodynamic  matrix  [A]  in  the  p-k  method  differs  from 
that  of  the  K-  method  by  a  factor  which  is  given  as  2kVpbl  In  the  p-k  method  flutter  solution,  the  complex 
eigenvalue  problem  (Eq.8)  is  solved  for  each  velocity  (V)  until  the  flutter  condition  is  obtained.  The  damping  values 
(g)  for  each  mode  is  plotted  against  the  airspeed  (V)  in  a  V-g  plot  to  determine  the  flutter  velocity. 

An  iterative  loop  must  be  carried  out  to  match  the  p  root  for  the  [A]  matrix  in  terms  of  assumed  k  until  a 
convergence  is  achieved.  Convergence  in  terms  of  under-  or  overrelaxation  on  k  values  may  not  be  obtained  at  or 
near  the  flutter  boundaries.  A  more  robust  method  of  convergence  for  k  is  needed  for  use  in  commercial  software 
for  design  applications.  Also,  there  is  a  possibility  that  more  than  n  modes  at  a  given  airspeed  (V)  may  match  the  p 
root.  These  additional  modes  represent  aerodynamic  poles  rather  than  trae  roots  of  the  p-k  aeroelastic  equation. 
These  extraneous  roots  may  cause  problems  in  mode  tracking  algorithms  [4]. 

Both  the  above  two  methods  have  been  implemented  in  MSC/NASTRAN  software,  and  only  p-k  method 
has  been  implemented  in  ASTROS  software.  A  systematic  study  of  flutter  problems  by  Tischler  et  al.  [9]  revealed  a 
good  agreement  between  NASTRAN  and  ASTROS  softwares. 

Eigenvector  Orientation  (EVO)  Method 

Both  K-method  and  p-k  method  flutter  solutions  are  based  on  the  complex  eigenvalues  (tracking  zero 
values  of  modal  damping)  to  predict  the  onset  of  flutter  instability.  In  other  words,  when  the  damping  is  ignored,  the 
flutter  instability  is  found  when  the  lowest  two  eigenvalues  coalesce  and  become  complex  conjugate  pairs. 
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The  aeroelastic  system  of  equations  in  terms  of  airspeed  or  reduced  frequency  parameter  (Eqs.  4  or  8)  when  cast 
into  a  complex  eigenvalue  problem  can  be  solved  to  get  eigenvalues  and  eigenvectors.  Previous  studies  in  the 
literature  focussed  mostly  on  the  complex  eigenvalues  but  not  on  the  complex  eigenvectors.  Recently,  Afolabi, 
Pidaparti  and  Yang  [3]  used  the  idea  of  eigenvector  orientation  based  on  complex  eigenvectors  to  predict  the  onset 
of  flutter  instability.  The  eigenvectors  initially  real  and  orthogonal  to  each  other,  lose  their  orthogonality  at  the 
flutter  instability.  So,  when  the  angle  between  any  two  eigenvectors  deviates  from  90  degrees  indicates  the  extent  to 
which  the  aeroelastic  system  is  cloase  to  the  flutter  boundary.  Therefore,  in  the  EVO  method,  the  angle  between  any 
two  complex  eigenvectors  (any  modes),  or  its  deviation  from  90  degrees  or  when  the  rate  of  change  of  EVO  angle  is 
zero,  can  be  used  to  track  the  flutter  condition.  For  real  time  flutter  control  process,  one  can  use  the  EVO  method  as 
it  can  be  easily  monitored  the  EVO  angle  close  to  zero  indicating  the  flutter  condition.  In  addition,  the  EVO  method 
serves  as  a  complement  to  the  existing  methods  based  on  eigenvalues. 


The  angle  between  two  complex  vectors  is  calculated  by  mapping  a  complex  vector  into  a  real  vector  by 
grouping  the  imaginary  components  after  the  real  components.  For  example,  the  angle  between  two  eigenvectors 
((j)i  and  (t)2)  can  be  found  as, 


e 


12 


=  COS 


'4*2 


(10) 


The  angle  between  any  two  eigenvectors  is  90  degrees  initially  (orthogonal  modes)  and  approaches  zero  at 
the  flutter  instability  (modes  are  no  longer  orthogonal)  when  the  airspeed  (V)  or  reduced  frequency  (k)  is  varied. 
This  method  of  tracking  flutter  condition  is  completely  different  from  V-g  plot  using  K  and  p-k  methods.  The  EVO 
method  of  predicting  the  onset  of  flutter  instability  has  never  been  implemented  in  any  general  purpose  aeroelastic 
design  optimization  software. 

Applications 

Three  examples  were  selected  (see  Fig.  1)  to  demonstrate  the  flutter  prediction  based  on  EVO  analysis  as 
discussed  in  the  previous  section.  For  each  of  the  examples,  MSC/NASTRAN  software  was  used  to  calculate  the 
flutter  condition  based  on  V-g  plot  using  PK  method.  A  post  processing  program  was  developed  to  extract  the 
complex  eigenvectors,  and  determine  the  angle  between  them  by  the  EVO  method.  Results  of  flutter  boundaries  are 
presented  to  illustrate  the  relative  merits  of  EVO  method  compared  to  V-g  method  for  tracking  the  onset  of  flutter 
condition.  The  examples  studied  range  from  subsonic  to  supersonic  flight  regimes,  and  transport  aircraft  wing  and 
sweptback  fighter  type  wing  to  demonstrate  the  further  applicability  of  the  EVO  method  for  design  applications. 


25-7 


Example  1:  A  15-degree  Sweptback  Wing 


A  15“  swept  untapered  wing  model  (see  Fig.  1)  made  from  magnesium  which  was  tested  in  a  wind  tunnel  is 
analyzed  for  supersonic  flutter  analysis.  The  wing  is  modeled  using  28  thin  shell  QUAD4  elements  (40  nodes)  and 
is  fixed  at  the  root.  The  elements  representing  leading  and  trailing  edge  taper  in  thickness  to  zero  whereas  the  rest 
of  the  elements  have  a  thickness  of  0.041  inches.  For  the  aerodynamic  model,  the  wing  is  divided  into  four 
chordwise  regions  and  six  equal  width  strips.  Piston  theory  is  used  in  the  analysis.  More  details  about  stmcture  and 
aerodynamic  model  can  be  foxmd  in  Ref  [2].  The  flutter  analysis  results  are  presented  in  Figure  2.  The  results  of 
eigenvalues  and  damping  for  the  lowest  three  modes  are  in  given  in  Figs.  2  (a)  and  (b).  The  angle  between  the 
lowest  three  eigenvectors  (1,2  and  3)  is  presented  in  Fig.  2c.  The  V-g  plot  gives  a  flutter  velocity  of  2077  ft/s 
compared  to  an  experimental  value  of  2030  ft/s  [2].  The  flutter  velocity  predicted  from  EVO  method  (Fig.  2c)  is 
about  2067  ft/s  which  compares  well  with  experiments  as  well  as  V-g  analysis. 

The  same  15“  swept  wing  is  analyzed  for  low  supersonic  conditions  (M=1.3)  using  Mach  Box  lifting 
surface  aerodynamic  theory.  Twenty  control  points  were  specified  for  the  aerodynamic  model.  An  internal  surface 
spline  in  Nastran  interpolates  between  aerodynamic  grid  points  and  Mach  Box  centers.  The  calculated  aerodynamic 
forces  are  transferred  to  stmctural  nodal  points  with  a  linear  surface  spline.  The  flutter  analysis  results  are  compared 
in  Fig.  3.  It  can  be  seen  the  trend  for  EVO  angle  is  similar  to  the  previous  example.  The  flutter  velocity  predicted 
from  EVO  method  (Fig.  3c)  is  about  1630  ft/s  which  compares  well  with  V-g  analysis  (1582  ft/s)  using  NASTRAN 
and  1683  ft/s  using  ASTROS  softwares.  In  both  the  above  cases,  the  angle  between  the  two  eigenvectors 
corresponding  to  lowest  modes  reaches  a  minimum  (about  12“)  and  increases  again.  This  is  a  completely  different 
trend  from  the  panel  flutter  examples  showed  by  Afolabi,  Pidaparti  and  Yang  [3].  In  all  their  examples,  the  angle 
between  the  eigenvectors  reached  approximately  zero  condition  at  the  flutter  instability.  However,  in  the  present 
case,  the  flutter  condition  is  reached  when  the  two  modal  eigenvectors  are  not  completely  orthogonal  (close  to 
losing  orthogonality).  Therefore,  the  flutter  criteria  may  corresponds  to  a  situation  when  the  EVO  angle  reaches  a 
minimum  and  increases  again  (rate  of  change  of  EVO  angle  is  zero,  as  in  Fig.  3(b)). 

Example  2:  Jet  Transport  Wing/Aileron 

This  example  is  taken  from  MSC/NASTRAN  Aeroelastic  Analysis  manual  [2].  The  stractural  model 
consists  of  inputting  the  flexibility  influence  coefficients  at  10  grid  points.  The  grid  point  1 1  corresponds  to  the  rigid 
fuselage  at  the  root  of  the  wing  elastic  axis.  Strip  theory  aerodynamics  with  an  approximation  to  circulation  function 
was  used.  The  wing  is  idealized  into  58  aerodynamic  boxes  (five  for  fuselage  and  six  aileron)  for  aerodynamic 
model  in  the  flutter  analysis.  Figure  4  shows  the  results  of  flutter  speed  using  V-g  analysis  and  EVO  method.  The 
rate  of  change  of  EVO  angle  and  damping  is  presented  in  Fig.  4(b).  The  EVO  method  predicts  the  flutter  speed  of 
1 183  ft/s  whereas  V-g  method  predicts  the  flutter  speed  to  be  1 133  ft/s.  A  good  agreement  is  seen  in  this  case  too. 
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Example  3:  Intermediate  Complexity  Wing 


The  intennediate  complexity  wing  (ICW)  example  is  taken  from  ASTROS  software  and  has  been  used  in 
many  applications  in  MDO.  The  wing  geometry  is  given  in  Fig.  1.  The  geometry  of  ICW  is  30°  sweptback  wing 
with  upper  and  lower  skin  surfaces  consisting  of  balanced  composite  lay-up  of  (0°/±  45°/90°)  with  0°  fibers  aligned 
with  the  midspar  of  the  wing.  The  stractural  model  of  the  wing  (total  88  nodes)  consists  of  62  quadrilateral  and  2 
triangular  membrane  elements  (skins),  55  shear  panels  (ribs  and  spars),  and  39  rod  elements  (posts).  The  wing  is 
cantilevered  at  the  root.  For  aerodynamics,  the  wing  is  modeled  as  a  flat  plate  lifting  surface  with  72  boxes  (9 
spanwise  and  8  chordwise  with  imequal  spacing).  The  aero  stmctural  interconnection  is  defined  by  two  surface 
splines.  Flutter  analysis  is  carried  out  at  M  =  0.7  using  PK  method.  Figme  5  shows  the  results  of  flutter  speed  using 
V-g  analysis  and  EVO  method.  The  EVO  method  predicts  the  flutter  speed  around  800  ft/s  whereas  V-g  method 
predicts  the  flutter  speed  to  be  939  ft/s  and  926  ft/s  using  NASTRAN  and  ASTROS  softwares,  respectively.  The 
difference  in  flutter  speed  between  EVO  and  V-g  analyses  may  be  attributed  to  the  fact  that  the  present  finite 
element  model  is  not  accurate  eneough  to  give  good  mode  shapes.  It  will  be  interesting  to  remesh  ICW  and  see  how 
the  EVO  angle  predicts  the  flutter  speed. 

The  three  different  examples  studied  in  this  investigation  illustrate  that  EVO  method  of  flutter  prediction 
indeed  predicts  results  comparable  to  V-g  method  and  experiments.  The  question  now  is  to  address  the  possible 
benefits  of  the  EVO  method  as  compared  to  the  V-g  method.  In  the  EVO  method,  the  angle  reduces  from  90° 
monotonically,  definitely  reaches  a  minimum  and  increases  again.  If  one  is  to  use  real  time  automatic  tracking  of 
flutter  instability,  tracking  the  EVO  angle  from  90°  seems  an  easy  task  compared  to  tracking  damping  (which  could 
be  multiple  valued)  for  any  general  case. 

Concluding  Remarks 


Flutter  prediction  methods  based  on  complex  eigenvalues  is  reviewed  along  with  recently  developed 
method  based  on  complex  eigenvectors.  Comparisons  of  flutter  predictions  between  the  popular  V-g  method  and  the 
new  Eigenvector  Orientation  method  were  made  for  three  different  examples.  Good  comparison  of  the  flutter  speeds 
between  EVO  method  and  V-g  method  illustrate  the  applicability  of  EVO  method  for  realistic  aircraft  wings.  One 
limitation  of  EVO  method  of  flutter  prediction  may  be  the  mesh  dependancy  to  get  accurate  mode  shapes.  Based  on 
the  examples  studied,  the  EVO  method  shows  the  feasibility  of  automating  the  flutter  prediction  process  and 
achieving  real  time  control. 
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Fig.  2c  EVO  Angle  vareut  Velocity  Plot  for  15  deg.  Sweptbeck  Wing  under  High  Mach  Number 


Fig.  2  Comparison  of  Eigenvalue  Coalescence,  Damping  and  EVO  angle  for  15  degree  Sweptback  Wing 

(High  Supersonic) 
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Fig.  3(a)  Comparison  of  V-g  versus  V-EVO  angle  plots  for  15  degree  Sweptback  Wing  (Low 

Supersonic) 
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Fig.  3(b)  Rate  of  Change  of  Damping  and  EVO  Angle  with  Velocity  for  Low  Supersonic  15  Deg. 
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Fig.  3  Comparison  of  Damping  and  EVO  angle  and  their  rate  using  V-g  method  and  V-EVO  method  for  15  degree 
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25-13 


EVO  Angle  (Degrees) 


of  Change 


4.00E-02 


r  90 

-  80 

-  70 

-  60 

-  50 

-  40 

-  30 

-  20 

-  10 

- - -  -  -  -  0 

Velocity  (ft/s) 

Fig.  4(a)  Comparison  of  V-g  and  V-Evo  Angle  plots  for  Jet  Transport  Wing/Aileron 


Velocity  (ft/s) 
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Fig.  5  Comparison  of  Damping  and  EVO  angle  and  their  rate  using  V-g  method  and  V-EVO  method  for 

Intermediate  Complexity  Wing 


Characterization  of  BN-Doped  SiC  Epitaxial  Layers 


Stephen  E.  Saddow 
Assistant  Professor 

Emerging  Materials  Research  Laboratory 
Department  of  Electrical  &  Computer  Engineering 


Mississippi  State  University 
Box  9571 


Mississippi  State,  MS  39762 


Final  Report  for: 

Summer  Faculty  Research  Program 
Air  Force  Research  Laboratories 
Wright  Patterson  AFB 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Green  Air  Force  base,  DC 

And 

Air  Force  Research  Laboratories 
Wright  Patterson  AFB 


August  1998 


26-1 


Characterization  of  BN-Doped  SiC  Epitaxial  Layers 


Stephen  E.  Saddow 
Assistant  Professor 

Emerging  Materials  Research  Laboratory 
Department  of  Electrical  &  Computer  Engineering 
Mississippi  State  University 


Abstract 

The  electrical  properties  of  4H-SiC  epitaxial  layers  on  semi-insulating  substrates  were 
studied.  The  epitaxial  layers  were  grown  by  the  Emerging  Materials  Research  Laboratory 
(EMRL)  using  an  existing  cold-wall  chemical  vapor  deposition  (CVD)  system.  These  layers 
were  doped  during  growth  using  a  solid-phase  boron-nitride  source,  which  had  previously  been 
shown  to  dope  the  layers  with  boron,  nitrogen  and  aluminum.  Capacitance- Voltage  (C-V), 
cathodoluminescence  (CL),  Hall  effect,  thermal  admittance  spectroscopy  (TAS)  and  optical 
admittance  spectroscopy  (OAS)  were  used  to  characterize  these  layers.  CL  data  indicated  the 
presence  of  the  boron-related  D-center  (Ea  =  Ev  +  0.601f).2  eV),  which  has  a  characteristic  green 
luminescence  when  donor  to  D-center  recombination  occurs.  TAS  showed  evidence  of  a  deep 
level  but  sample  instabilities  prevented  complete  characterization  from  being  performed.  OAS 
data  was  gathered  and  compared  with  semi-insulating  substrate  material  from  Cree  Research  Inc. 
which  permitted  a  direct  comparison  between  observed  optically-active  defects  in  both  materials. 
Finally  Hall  data  indicate  the  possibility  of  hopping  conduction  in  slightly  p-type  layers,  while  in 
slightly  n-type  layers  there  was  a  measurable  mobility.  However,  characterization  of  thin  films 
on  insulating  substrates  using  the  Hall  Effect  was  inconclusive  and  will  require  further  study. 
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Characterization  of  BN-Doped  SiC  Epitaxial  Layers 
Stephen  E.  Saddow 


1.  Introduction 

Silicon  carbide,  despite  on-going  efforts  to  develop  a  reliable  implantation  doping  technology, 
remains  an  epitaxial  technology.  Chemical  vapor  deposition  (CVD)  remains  the  principal  means 
by  which  device  quality  SiC  semiconducting  films  are  obtained  with  specific  doping 
concentrations.  There  are  three  principal  dopants  for  "shallow"  donor  and  acceptor  doping,  they 
are  nitrogen,  aluminum,  and  boron.  Nitrogen  doping  for  donor  formation  is  a  relatively 
satisfactory  technology.  Acceptor  doping  favors  aluminum  because  aluminum  forms  a  shallower 
acceptor  level  than  does  boron.  However,  interest  remains  for  boron  acceptor  doping.  More 
recently,  suggestions  for  using  the  boron-related  D-center  deep-level  defect  (reported  to  lie 
between  Nv  +  0.58  eV  and  Nv  +  0.73  eV)  to  compensate  nitrogen  donors  to  produce  semi- 
insulating  SiC  epitaxial  layers  have  been  published.*  This  paper  provides  experimental  results  to 
suggest  that  codoping  SiC  with  boron  and  nitrogen  from  a  boron-nitride  solid  source  may 
provide  an  economical  means  for  achieving  semi-insulating  epitaxial  layers. 

The  Emerging  Materials  Research  Laboratory  (EMRL)  at  Mississippi  State  University  (MSU) 
has  been  developing  a  process  for  growing  semi-insulating  (SI)  epitaxial  layer.  This  technique 
involves  the  use  of  a  boron-nitride  (BN)  solid-source  dopant  scheme  whereby  radiant  heating  of 
a  BN  disk  is  sufficient  to  out  gas  boron,  nitrogen  and  aluminum  from  the  BN  disk.  These 
elements  are  then  transported  to  the  SiC  growth  region  via  a  hydrogen  carrier  gas,  the  result 
being  the  doping  of  the  resulting  epitaxial  layer  with  all  three  species.  Numerous  appUcations  for 
SI  SiC  epitaxial  layers  are  envisioned,  including  intrinsic  (i.e.,  “i”)  layers  for  p-i-n  diodes,  field 
termination  structures  for  high-voltage  power  devices^  and  buffer  layers  for  high-frequency 
devices. 

Experimental  results  have  been  reported  for  boron-doping  6H  silicon  carbide  epitaxial  layers 
from  a  sohd  boron-nitride  source.  Boron  doping  concentrations  from  more  than  10*®  cm'^  to  less 
than  10*^  cm‘^  were  observed.^  Doping  concentrations  were  correlated  with  the  temperature  of 
the  BN  source  during  CVD  growth  of  the  6H  SiC.  Donor  co-doping,  most  likely  from  nitrogen, 
was  also  observed,  in  some  cases  leading  to  substantial  compensation.  Unfortunately,  the  exact 
physical  mechanism  responsible  for  high  resistivity  of  this  material  cannot  be  extracted  when 
these  layers  are  grown  on  conducting  substrates.  Since  semi-insulating  (SI)  substrates  in  4H-SiC 
are  commercially  available,  the  EMRL  BN-doping  process  was  transitioned  to  SI  substrate 
material,  which  should  permit  a  more  accurate  assessment  of  the  BN-doped  film  to  be  made. 


2.  Epitaxial  Layer  Samples 

A  standard  silane/propane  based  precursor  growth  technology  was  used  to  grow  several  4H-SiC 
epitaxial  layers  on  4H-SiC  substrates.  A  50-mm  diameter  quartz  reaction  tube  with  an  outer 
water  jacket  to  permit  “cold  wall”  processing  was  used.  This  system  is  substantially  similar  to 
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that  developed  by  the  group  at  the  NASA  Lewis  Research  Center.  A  30-mm  silicon-carbide 
coated  graphite  susceptor  is  supported  on  a  quartz  boat  that  is  inserted  into  the  reaction  tube. 
One  or  more  1  cm  square  pieces  of  4H  silicon  carbide  rest  on  the  susceptor.  A  radio-frequency 
induction  coil  surrounds  the  quartz  reaction  tube,  which  heats  the  susceptor  to  the  growth 
temperature.  For  the  results  reported  here,  the  growth  temperature  at  the  substrate  was  typically 
maintained  around  1550  C,  as  measured  with  an  optical  pyrometer  and  corrected  for  absorption 
through  the  quartz.  The  4H  substrate  pieces  were  cut  from  SI  Cree  wafers  polished  on  one  side 
(Si  face)  which  were  provided  by  Dr.  W.  Mitchel  of  the  AFRL  Materials  Laboratory.  Prior  to 
insertion  into  the  reactor,  the  substrate  pieces  were  cleaned  ultrasonically  with  solvents,  rinsed  in 
deionized  water,  and  blown  dry  with  nitrogen. 

Co-doping  with  boron  and  nitrogen  from  a  solid  source  was  provided  by  a  single  piece  of  boron 
nitride  cut  from  a  two-inch  wafer  specifically  manufactured  as  a  boron  diffusion  source  for  use 
in  silicon  processing.  The  trailing  edge  of  the  BN  was  placed  a  measured  distance  upstream  of 
both  the  susceptor  and  the  silicon-carbide  substrate  (see  Fig.  1).  The  substrate  was  typically 
between  10  and  15  mm  downstream  of  the  leading  edge  of  the  susceptor.  While  the  BN  was 
never  placed  on  the  heated  susceptor,  on  one  occasion  it  was  allowed  to  touch  the  upstream 
(“leading”)  edge  of  the  susceptor  (the  “0-mm”  case).  In  the  samples  prepared  for  this  SFRP 
effort,  the  BN  was  placed  3  mm  from  the  susceptor,  since  growth  runs  on  conductive  substrates 
indicated  this  situation  resulted  in  highly-resistive  material 

The  typical  growth  run  proceeded  as  follows.  After  loading  and  purging  with  ultra-high  purity 
(UHP)  argon  followed  by  UHP  hydrogen,  the  reaction  tube  was  evacuated  with  a  mechanical 
pump  several  times.  Each  time,  the  reaction  tube  was  back  filled  with  UHP  hydrogen,  and  then 
pumped  back  down.  Following  the  pump-purge  cycles,  a  steady  atmospheric-pressure  UHP 
hydrogen  flow  was  established  at  a  rate  of  3  1/m.  The  RF  power  source  was  then  engaged  and 
the  susceptor  rapidly  heated  to  the  growth  temperature  (typically  requiring  between  1  and  2 
minutes).  As  the  growth  temperature  was  reached,  flows  of  3%  silane  (SiHt)  in  hydrogen  and 
3%  (by  volume)  propane  (CsHg  )  in  hydrogen  commenced.  The  flow  rates  were  metered 
continuously  with  mass-flow  controllers,  and  were  set  to  15  seem  for  silane  while  the  propane 
flow  was  varied  to  set  a  desired  silicon-to-carbon  atomic  ratio  in  the  growth  tube.  (Experience 
with  epitaxial  layer  growth  on  conductive  substrates  indicated  that  a  silicon  to  carbon  ration  on 
the  order  of  0.16-0.18  should  yield  SI  epi.)  Growth  proceeded  for  75minutes  and  was  terminated 
by  the  following  purge  sequence  implemented  by  a  programmable  logic  controller:  The  CsHg 
and  SiH4  flows  were  terminated,  followed  by  30  seconds  of  UHP  Ha  flow  at  3  1/m  and  at  growth 
temperature,  followed  by  4  minutes  of  UHP  argon  flow  at  3  1/m  and  at  growth  temperature, 
followed  by  RF  power  shutoff  and  susceptor  cool  down  for  10  additional  minutes  in  UHP  argon. 
Upon  removal,  both  the  BN  pieces  and  the  SiC  pieces  were  inspected.  The  surfaces  of  the  SiC 
pieces  invariably  had  a  specular  or  near  specular  reflection,  indicating  good  quality 
homoepitaxial  layers.  Generally,  this  was  confirmed  by  optical  microscopy,  which  did  reveal  the 
usual  epitaxial-layer  morphological  imperfections,  some  of  which  were  clearly  related  to 
substrate  issues.  Except  in  the  cases  previously  noted,  the  BN  pieces  were  largely  unchanged. 
The  SiC  pieces  were  then  transported  to  the  AFRL  by  the  principal  investigator  for 
characterization,  the  focus  of  this  SFRP  effort. 
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Epitaxial  Sample  Summary 

Various  epitaxial  layers  were  grown  by  EMRL  for  characterization  during  the  SFRP  at  the 
AFRL.  The  details  of  the  epitaxial  layers  are  listed  in  Table  1  for  reference.  Each  sample  was 
grown  to  achieve  a  specific  objective  to  help  develop  a  full  understanding  of  the  electrical 
properties  of  EMRL  BN-doped  epitaxial  layers. 


Table  1  EMRL  SiC  Epitaxial  Layer  Growth  Summary 


Sample  No. 

Si/C 

Net  Doping  Density 
[cm"^]'^^ 

Net  Doping  Density 
[cm-^]^ 

Characterization 

Planned*^ 

MSU-98-4H-51 

0.16 

2-6  X  lO*'*  (p-type) 

— 

HaU,CV 

MSU-98-4H-52 

0.18 

~10'^ 

CV,  CL,  OAS,  TAS 

MSU-98-4H-53®^ 

0.18 

~10‘"^ 

- 

HaU 

MSU-98-4H-56 

0.18 

lO'^^-lO*^ 

10‘'‘-10‘^ 

HaU,  CV,  TAS 

MSU-98-4H-57^> 

0.20 

~10‘^ 

~10‘^ 

HaU,  CV,  TAS 

MSU-98-4H-69a 

0.20 

See  note  G 

See  note  H 

HaU 

MSU-98-4H-69b 

0.20 

(( 

a 

CV,  OAS,  TAS 

MSU-98-4H-70a 

0..18 

ti 

(( 

HaU 

MSU-98-4H-70b 

0.18 

(( 

n 

CV,  OAS,  TAS 

as  determined  by  Hg-probe  C-V  at  EMRL. 

a  degenerate  cap  layer  was  grown  in-situ  after  BN-doped  epi  growth. 

^  cap  layer  cone,  only  measured.  BN-doped  epi  cone,  should  be  identical  to  sample  52. 

’  BN  doped  with  x  =  3.5  mm. 
as  determined  by  Schottky  contact  C-V  at  AFRL 
^  estimated  epi  thickness  ~  2  pm  for  all  samples  except  MSU-98-4H-57  which  is  ~  3  pm 
Hg-probe  data  too  noisy  for  concentration  estimation 
CV  not  measured  at  AFRL  at  time  of  this  report 

Sample  Preparation 

Hall  Effect  and  Admittance  Spectroscopy  characterization  require  that  metal  contacts  be  placed 
onto  the  sample.  In  the  first  case,  Hall  Effect,  these  contacts  must  display  “ohmic-like”  behavior, 
which  can  be  achieved  in  one  of  two  ways.  The  classical  approach  is  to  select  a  metal  whose 
work  function  is  such  that  ohmic  behavior  is  achieved.  For  semiconductors  where  such  a  suitable 
metal  is  not  available,  ohmic  contact  is  achieved  via  the  use  of  a  heavily-doped  degenerate  layer 
which  ensures  that  a  Schottky  Tunneling  contact  is  realized.  Admittance  Spectroscopy  requires 
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only  a  Schottky  contact,  and  thus  a  degenerate  layer  is  not  required.  Cathodoluminescence  (CL) 
does  not  require  well-defined  contacts,  but  rather  contact  between  the  sample  surface  and  the 
low-temperature  cryostat  such  that  excess  charge  can  be  bled  off  of  the  sample  surface.  Indium 
solder  was  used  to  achieve  this  goal,  after  which  the  In  was  removed  using  a  HCL  and  Nitric 
Acid  dip. 

For  sample  MSU-98-4H-53  electrical  contact  was  formed  to  the  degenerate  layer  via  the 
deposition  of  1,000 A  of  Ni  which  was  subsequently  annealed  at  950°C  in  a  rapid  thermal 
annealing  (RTA)  system.  The  sample  was  then  loaded  into  a  reactive  ion  etching  (RIE)  system"* 
to  remove  the  N*  epitaxial  layer  between  the  Ni  contacts  (see  Figure  1).  A  total  of  0.5|j.m  was 
removed,  ensuring  that  the  electrical  conduction  between  the  Ni  contacts  was  due -to  the 
underlying  BN-doped  epitaxial  layer  only.  After  RIE,  the  electrical  behavior  between  the 
contacts  were  observed  to  be  that  of  a  pn  junction,  indicating  that  the  underlying  epitaxial  layer 
is  not  purely  SI  but  slightly  p-type. 


Figure  1  AFTT  RIE  system  used  to  etch  MSU  SiC  cap  layers,  (a)  Photo  of  system  with  sample 
loaded,  (b)  MSU-98-4H-53  sitting  on  parallel  plate  electrode  inside  RIE  system  after  etch.  TTie 

Van  der  Pauw  contacts  are  visible  in  the  image. 

Using  a  standard  ohmic  contact  recipe  developed  by  Scofield  et  al,^  contact  was  made  to  sample 
MSU-98-4H-56  via  the  DC  sputtering  of  Al/Ni/W/Au  of  2,000A/2,000A/2,000A/5,(X)0A, 
respectively.  A  shadow  mask  was  used  to  pattern  a  van  der  Pauw  structure  on  the  sample.  The 
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contacts  were  then  annealed  at  925  °C  in  an  4%  N2  in  H2  forming  gas.  Ohmic  contact  was 
observed  via  the  measurement  of  the  current-voltage  (I-V)  behavior  between  the  contacts.  A 
similar  process  was  used  to  form  van  der  Pauw  contacts  on  sample  MSU-98-4H-57,  69a  and  70a. 
To  prepare  Schottky  contacts  for  Admittance  Spectroscopy,  A1  contacts  were  sputtered  and 
patterned  using  photolithography. 

Except  for  sample  MSU-98-4H-53,  all  other  samples  (MSU-98-4H-56,  57,  69,  70)  did  not 
contain  a  degenerate  layer  for  ohmic-like  contact  synthesis,  but  rather  relied  upon  deposition  of 
the  proper  metalization  to  achieve  ohmic  contacts.  Metal  deposition  was  performed  at  AFRL  on 
these  samples  by  Gerry  Landis  of  University  of  Dayton  Research  Institute  (UDRI)  and  consisted 
of  the  metal  layers  listed  in  Table  2. 


Table  2  Ohmic  Contacts  on  n-  and  p-type  SiC 


Material 

Metal  1 

Metal  2 

Metals 

Metal  4 

Anneal 

n-type 

A1 

Ni 

W 

Au 

925  °C* 

p-type 

NiCr* 

Cr 

Au 

— 

925  °C* 

*  Anneal  performed  in  4%  N2  in  H2  forming  gas  for  10  minutes 

*  Anneal  performed  after  NiCr  deposition 


3.  Epitaxial  Layer  Characterization 

It  is  well  known  that  accurate  characterization  assessment  of  epitaxial  layer  properties  requires 
that  a  multitude  of  characterization  techniques  be  employed.  The  Air  Force  Research 
Laboratories  (AFRL)  has  considerable  characterization  resources,  specifically  intended  to  assess 
the  material  properties  of  wide  band  gap  (WBG)  semiconductors,  like  4H-SiC.  As  a 
consequence,  several  characterization  experiments  were  performed  during  the  SFRP  visit  to 
AFRL.  These  were  Cathodoluminescence  (CL)  to  assess  optically  active  defects.  Hall  Effect  to 
measure  the  film  resistivity  and  carrier  mobility,  and  Admittance  Spectroscopy  to  determine  the 
Fermi  Level  (Ef)  position  in  the  band  gap.  Each  of  these  experiments  are  now  described  in  detail. 

Capacitance- Voltage  (CV) 

Using  standard  Capacitance- Voltage  (CV)  methods,  the  doping  density  and  carrier  type  of  each 
epitaxial  layer  were  measured.  The  results  are  listed  in  Table  1  for  reference.  For  each  epitaxial 
layer  the  conductivity  was  determined  to  assist  with  subsequent  data  analysis.  In  addition,  the 
well-known  fact  that  SiC  dopants  are  not  “classically  shallow,”  (i.e.,  within  a  few  kT  of  the  band 
edges)  requires  the  assessment  of  the  conductivity  type  to  be  performed  at  elevated  temperatures. 
As  a  consequence,  C-V  profiles  were  measured  at  elevated  temperatures  to  ensure  an  accurate 
assessment  of  both  the  net  doping  density  as  well  as  the  conductivity  type. 
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Cathodoluminescence  (CL) 


Using  the  luminescence  system  (both  CL  and  photoluminescence,  PL)  of  the  Air  Force  Institute 
of  Technology  (AFIT),  a  4H-SiC  sample  containing  what  was  believed  to  be  a  semi-insulating 
(or  at  least  highly  compensated)  epitaxial  layer  was  characterized  by  the  PI  and  Dr.  J.  Scofield  of 
AFRL.  The  CL  system  is  capable  of  detecting  luminescence  from  centers  located  throughout  the 
4H-SiC  band  gap.^  One  defect  of  considerable  interest  is  the  boron-related  D-center  (Ea  =  Ev  + 
0.60±0.2  eV),  which  displays  a  characteristic  green  luminescence  in  4H-SiC  with  a  peak  energy 
of  approximately  2.3-2.4  eV.’  Observation  of  this  characteristic  defect  signature  would  indicate 
that  a  sufficient  D-center  concentration  was  present  in  the  material  which  is  the  critical  deep- 
level  required  to  achieve  close  compensation  in  these  films. 

A  second  center  that  is  expected  to  be  present  from  earlier  characterization  of  these  fQms  is  the 
aluminum  acceptor^  (Ea  =  Ev  +  0.15  eV).  A  classic  donor-acceptor  pair  (DAP)  recombination  in 
the  violet  portion  of  the  visible  spectrum  is  associated  with  this  recombination  event,  and  should 
be  clearly  observable  if  sufficient  aluminum  acceptors  are  present. 

Both  of  these  luminescent  signatures  were  observed  during  the  CL  experiments,  as  the  low- 
temperature  (9K)  experimental  data  of  Figure  1  indicates.  The  violet  transition  was  clearly 
visible  to  the  naked  eye  when  the  sample  was  cooled  to  less  than  50K,  otherwise  the  green  D- 
center  luminescence  was  dominant.  Comparison  of  this  data  to  the  established  literature  on  4H- 
SiC  indicates  that  these  spectra  are  indeed  the  nitrogen  to  aluminum  and  nitrogen  to  D-center 
DAP  recombination  signatures,  respectively.* 


Figure  1.  CL  spectra  from  epitaxial  layer  no.  MSU-98-4H-52.  Spectra  taken  at  9  K.  Note  the 
nitrogen  to  aluminum  and  nitrogen  to  D-center  DAP  signatures  at  -4,500  A  and  -5,000  A, 

respectively.  The  beam  current  was  80  pA 
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In  order  to  fiilly  characterize  the  observed  luminescence,  the  temperature  dependence  of  the 
various  peaks  must  be  measured  (see  Fig.  2).  We  see  that  the  violet  transition  indicative  of  the 
nitrogen  to  aluminum  DAP  recombination  quenches  for  temperatures  greater  than  about  50  K, 
and  the  nitrogen  to  D-center  DAP  dominates,  and  is  clearly  observable  for  temperatures  as  high 
as  room  temperature  (see  Fig.  2(a)).  In  addition,  we  took  a  finer  look  at  the  luminescence 
structure  near  the  band  edge  (-4,000  A)  to  see  if  any  exciton  peaks  were  observable  (see  Fig. 
2(b)).  We  did  not  observe  any  excitonic  transitions  in  this  material  which,  given  the  extra  doping 
of  this  material  as  compared  to  undoped  4H-SiC,^  was  not  a  surprise. 


Wavelength  (Angstroms) 


Wavelength  (Angstroms) 


Figure  2.  (a)  CL  spectra  taken  from  50K  to  300K,  showing  nitrogen  to  D  center  DAP 
recombination  peak.  Note  spectra  does  not  shift  with  temperature. 

(b)  Band-edge  CL  spectra  taken  at  8K. 


Admittance  Spectroscopy 

Admittance  Spectroscopy  is  one  technique  that  is  useful  for  determining  the  position  of  the 
Fermi  Energy  Level,  Ef,  in  the  semiconductor  bandgap.  As  is  the  case  for  all  spectroscopic 
methods,  the  semiconductor  conductivity  is  measured  while  a  control  parameter  is  varied,  such 
as  electrical  bias,  sample  temperature  or  incident  optical  radiation.  Using  the  extensive 
characterization  resources  available  within  the  Materials  Laboratory  and  staffed  by  the 
University  of  Dayton  Research  Institute  (UDRI),  Optical  Admittance  Spectroscopy  (OAS)^  and 
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Thermal  Admittance  Spectroscopy  (TAS)*"  measurements  were  performed  on  EMRL  epitaxial 
layers.  Samples  MSU-98-4H-52  and  MSU-98-4H-56  were  investigated  using  OAS  and  TAS,  the 
details  of  which  are  now  described. 

Optical  Admittance  Spectroscopy  ( OAS) 

Sample  MSU-98-52  was  investigated  using  OAS  and  the  results  compared  with  vanadium-doped 
Cree  substrate  material.  Since  the  Cree  material  is  semi-insulating  (SI),  this  comparison  should 
reveal  interesting  similarities  and/or  differences  between  EMRL  compensated  epi  and  V-doped 
SI  material  of  the  same  polytype.  Figure  3  shows  the  OAS  data  taken  during  the  SFRP  visit 
along  with  the  OAS  spectrum  for  the  Cree  V-doped  SI  substrate. 


Figure  3  OAS  spectrum  of  sample  MSU-98-4H-52  compared  with  Cree  V-doped  SI 

substrate. 

Inspection  of  Figure  4  reveals  the  following  differences  between  the  V-doped  Cree  material  and 
the  EMRL  epitaxial  layer:  Both  samples  contain  peaks  related  to  Ti,  as  well  as  band-edge 
transitions.  The  primary  difference  is  the  large  peak  centered  around  2.5  eV  that  is  only  present 
in  the  EMRL  material.  We  believe  this  peak  is  related  to  the  boron-related  D-center,  which  is 
located  approximately  2.5  eV  below  the  conduction-band  edge  (i.e.,  Ea  =  Ec  -  2.5  eV).  This 
result  is  consistent  with  the  CL  data  discussed  in  the  last  section,  where  the  green  luminescence 
was  associated  with  donor  to  acceptor  (DAP)  recombination  between  the  nitrogen  donor  and  D- 
center.  The  difference  with  respect  to  the  OAS  data  is  that  this  DAP  transition  cannot  be 
observed;  rather,  only  conduction  band  to  recombination  center  transitions  are  observable. 
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Thermal  Admittance  Spectroscopy  (TAS) 


Using  measurement  techniques  developed  at  AFRL  and  described  in  the  literature,^  thermal 
admittance  spectroscopy  (TAS)  measurements  were  performed  on  Sample  MSU-98-4H-52  at 
measurement  frequencies  of  15,  30,  45,  60  and  75  kHz.  A  broad  TAS  peak  was  observed  during 
preliminary  measurement  of  this  sample  in  a  low-temperature  cryostat  (sample  measurement 
conducted  from  50  K  to  380  K,  with  a  peak  determined  to  be  greater  than  380  K).  Due  to 
instabilities  in  the  device  properties,  TAS  peaks  could  only  be  recorded  for  the  lowest  three 
frequencies.  As  a  consequence,  an  accurate  determination  of  the  activation  energy  of  the  peak 
could  not  be  made;  however,  upper  and  lower  bounds  for  the  observed  level  could  be  estimated 
and  are  believed  to  be  between  50  and  120  meV,  which  is  consistent  with  the  known  nitrogen 
donor  in  n-type  4H-SiC. 


Hall  Effect 

A  sample,  identical  to  MSU-98-4H-52  with  the  addition  of  a  n^  cap  layer,  was  grown  by  EMRL 
for  the  specific  intention  of  providing  a  suitable  device  structure  for  Hall  Effect  measurement. 
This  sample,  MSU-98-4H-53,  was  prepared  for  Hall  measurement  using  the  RIE  sample 
preparation  procedure  outlined  earlier.  Unfortunately,  this  sample  proved  to  be  slightly  p-type, 
and  thus  pn  junctions  were  formed  under  the  RIE  patterned  Ni  contacts.  As  a  consequence.  Hall 
measurement  of  this  sample  was  not  possible  using  the  Ni  contacts. 

Four  additional  epitaxial  layers  were  grown  and  these  samples  delivered  to  AFRL  to  support  the 
SFRP  effort.  A  summary  of  the  Hall  samples  investigated  during  the  SFRP  has  been  provided  in 
Table  1.  Each  sample  will  now  be  discussed  in  detail. 

Hall  data  taken  during  the  SFRP  on  sample  MSU-98-4H-56  indicated  that  the  epitaxial  layer  is 
not  semi-insulating,  with  a  room-temperature  sheet  resistivity  of  8  x  10^  Q-cm/square.  Given  a 
epi  thickness  estimated  to  be  2  fim,  the  bulk  resistivity  of  the  epitaxial  film  is  thus  approximately 
160  Q-cm,  which  is  far  from  semi- insulating.  What  is  interesting  about  this  Hall  data  is  that  the 
geometrical  ratio  between  the  contacts  is  strongly  temperature  dependent,  which  is  an  indication 
of  highly  anisotropic  behavior.  In  addition,  the  epitaxial  layer  concentration  versus  temperature 
profile  indicates  that  hopping  conduction  may  be  occurring  in  this  sample,  which  is  typically  the 
case  for  highly-doped  material."  Yet  the  net  doping  concentration,  as  determined  by  CV,  is  in 
the  low  10*^  cm'^  range.  It  may  therefore  be  the  case  that  close  compensation  is  responsible  for 
this  peculiar  result.  High  temperature  Hall  measurement  should  reveal  the  presence  of  hopping 
conduction  as  a  mode  of  carrier  transport,  which  would  be  manifested  in  a  double-slope  behavior 
evident  in  the  resistivity  versus  reciprocal  temperature  profile. 

Given  the  relatively  low  resistivity  of  sample  MSU-98-4H-56,  another  sample  was  prepared  with 
an  attempt  to  push  the  sample  into  the  semi-insulating  doping  regime.  This  was  accomplished  in 
two  ways.  First,  the  ratio  of  silicon-to-carbon  (Si/C)  was  increased  from  Si/C  =  0.18  to  Si/C  = 
0.2.  Second,  in  an  effort  to  reduce  the  incorporation  of  nitrogen  from  the  BN  solid-phase  doping 
source,  the  BN  doping  source  was  placed  0.5  mm  farther  away  from  the  graphite  susceptor  to 
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decrease  the  N  flux  in  the  reactor  during  the  growth  run.^  Hg-probe  C-I-V  indicated  that  the 
stated  objectives  of  these  two  epitaxial  layer  growth  process  changes  did  indeed  result  in  a  film 
that  is  likely  highly  compensated.  Van  der  Pauw  contacts  were  deposited  as  described  earlier, 
and  Hall  measurements  performed.  Figure  4  shows  representative  set  of  Hall  data  from  this 
sample  for  reference. 


2.0  2.5  3.0  3.5  4.0  4.5  5.0  5.5  6.0  6.5  2.0  2.5  3.0  3.5  4.0  4.5  5.0  5.5  6.0  6.5 

lOOO/TCK')  1000/T(K') 


Figure  4  Low-temperature  Hall  data  from  sample  MSU-98-4H-57  showing  evidence  of  hopping 
conduction.  Note  the  sample  has  a  room  temperature  resistivity  of  -160  Q-cm.  The  scatter  in  the 


mobility  data  does  not  permit  the  concentration  to  be  estimated. 
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Since  neither  of  these  layers  displayed  semi- insulating  performance,  a  second  set  of  Hall  samples 
were  prepared,  whose  parameters  are  listed  in  Table  1.  For  these  last  two  samples,  high- 
resistivity  material  was  again  not  observed,  as  shown  in  Figure  5.  However,  the  sample  did  have 
a  measurable  resistivity,  mobility  and  concentration,  although  there  is  concern  that  the 
interpretation  of  these  results  is  suspect  due  to  the  nature  of  the  sample  (thin  film  on  SI 
substrate).  The  primary  difference  is  that  the  material  displayed  n-type  conductivity  whereas  the 
epi  of  Fig.  4  was  p-type.  This  subtle  difference  could  be  significant,  as  will  be  described  in  the 
next  section  of  this  report. 
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Figure  5  Hall  data  from  sample  MSU-98-4H-69a,  showing  a  room  temperature  mobility, 
resistivity  and  concentration  of  ~  50  cmVV-s,  4  Q-cm  and  2  x  10*^  cm‘^. 
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4.  Discussion 


The  experimental  data  gathered  during  the  SFRP  tour  of  duty  at  AFRL  at  first  appears  to  be 
inconsistent,  but  after  looking  at  aU  of  the  data  the  outcome  is  quite  encouraging.  For  material 
that  was  somewhat  p-type  the  capacitance  was  very  low  (1-2  pF  as  determined  by  Hg-probe  CV), 
while  for  the  material  that  was  slightly  n-type  the  capacitance  was  even  lower  (see  Appendix  A 
for  C-I-V  data  on  samples  57  and  69a).  However,  in  both  cases  the  BN-doped  epi  films  appeared 
to  be  fully  depleted,  which  is  either  a  sign  of  very  lightly  doped  (net  doping  density  of  less  than 
10*'*  cm"^)  or  compensated  material. 

For  the  cases  of  slightly  n-type  material,  the  D-center  was  observed  via  cathodoluminescence 
(CL)  indicating  that  not  only  is  there  boron  incorporated  in  these  films  (this  had  been  established 
earlier  via  SIMS  characterization’)  but  that  an  observable  quantity  of  boron  is  forming  the  deep- 
level  needed  to  compensate  these  films  based  on  the  EMRL  technique:  the  D-center. 

The  Hall  Data  is,  in  some  ways,  supportive  of  these  results.  First,  for  slightly  n-type  material  firee 
carrier  mobility  and  concentrations  were  observed,  indicating  that  transport  was  primarily  due  to 
free-carrier  motion  and  not  limited  by  carrier  trapping  and  detrapping.  In  the  case  of  slightly  p- 
type  material,  evidence  of  Hopping  Conduction,  which  is  conduction  mediated  by  traps,  was 
observed.  This  is  fully  consistent  with  the  model  of  the  D-center  serving  as  an  acceptor  defect 
whereby  the  material  would  be  compensated  on  the  p-side  of  the  type  conversion  boundary. 

The  data  that  is  perhaps  the  most  interesting  is  the  admittance  data,  and  in  particular  the  optical 
admittance  spectroscopy  (OAS)  results.  Here  we  see  little  similarity  to  the  OAS  spectra  observed 
from  Cree  V-doped  SI  substrates,  in  particular  the  V-lines  are  absent.  What  is  common  to  both 
spectra  is  the  observation  of  Ti  peaks  and  a  peak  that  appears  to  be  originating  from  the  D-center. 
At  the  time  of  this  report  Dr.  Steve  Smith  of  UDRI  is  continuing  to  analyze  this  data  and 
compare  it  with  other  known  samples  to  develop  a  more  detailed  analysis  of  these  results. 

What  is  clear  at  this  juncture  is  the  need  to  continue  to  both  refine  the  BN-doping  scheme  and  to 
continue  to  perform  both  growth  and  characterization  experiments  to  fill  in  the  missing  data 
necessary  to  interpret  these  results.  The  most  obvious  result  of  the  Hall  experiments  is  the  need 
to  ensure  that  this  method  is  even  suitable  for  analysis  of  such  thin  films  on  insulating  substrates. 
The  obvious  question  in  this  regard  is  “what  is  the  actual  epi  volume  that  charge  conducts 
through”  since  the  estimate  of  this  volume  greatly  affects  the  data  interpretation.  Clearly 
experiments  need  to  be  performed  on  non-compensated  (i.e.,  non-BN-doped)  films  on  the  same 
substrate  material.  If  classic  Hall  results  are  achieved  for  this  very  simple  situation,  then  perhaps 
more  attention  can  be  paid  to  the  results  gathered  during  this  SFRP  on  BN-doped  epi  layers. 

The  most  important  outcome  of  the  SFRP  visit  to  AFRL  is  the  solid  working  relationship  that  has 
been  formed  between  the  EMRL  and  AFRL  SiC  researchers.  Although  the  principal  investigator 
has  been  familiar  with  the  AFRL  SiC  group  for  some  time,  and  a  modest  collaboration  had  been 
started  prior  to  the  undertaking  of  this  program,  there  is  no  substitute  for  working  together  to 
formulate  new  ideas  and  concepts.  This  is,  in  the  opinion  of  the  author,  the  most  significant 
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outcome  of  the  summer  visit  and  one  that  will  hopefully  lead  to  an  ever  expanding  collaboration 
in  the  future. 


5.  Future  Work 

Based  on  the  Experimental  Discussion  of  the  last  section,  it  is  viewed  by  the  PI  that  two  primary 
tasks  should  be  undertaken  to  continue  this  collaborative  effort.  The  first  task  is  simply  a 
continuation  of  the  experimental  investigation  undertaken  during  the  SFRP  tour  of  duty.  Indeed, 
at  the  time  of  this  report  writing,  the  PI  has  already  delivered  to  AFRL  a  n-type,  conventionally- 
doped  epi  layer  on  a  Cree  SI  substrate  for  hall  investigation.  As  mentioned  in  the  last  section,  if 
classic  Hall  data  results  from  this  sample,  then  it  may  become  possible  to  interpret  the  results  of 
Hall  experiments  conducted  on  EMRL  BN-doped  epi  layers.  In  addition,  AFRL  has  continued  to 
gather  data  after  the  PI  returned  to  MSU,  and  this  data  continues  to  be  analyzed  by  both  groups. 
In  addition,  TAS  and  OAS  experiments  have  recently  begun  at  AFRL  on  the  latest  samples 
grown  during  the  SFRP  visit.  These  samples  will  then  be  shipped  to  EMRL  where  DLTS 
experiments  wftl  be  performed  to  both  confirm  the  presence  of  the  D-center  in  this  material  and 
to  determine  its  concentration.  Knowing  the  D-center  concentration  as  a  function  of  growth 
parameters  is  critical  to  achieving  SI  epi  since  the  concentration  of  donors,  acceptors  and  deep- 
level  impurities  must  be  in  balance  for  close  compensation  to  occur. 

The  second  task  was  jointly  discussed  between  Dr.  WUliam  Mitchel  and  the  PI  extensively 
during  the  visit  and  is  really  two  subtasks,  both  having  to  do  with  EMRL  performing  epi 
experiments  to  support  AFRL  efforts  to  improve  SiC  substrates  and  ion  implants.  Considerable 
work  has  already  been  conducted  by  groups  either  working  at  AFRL  or  under  contract  to 
AFRL*^  in  the  area  of  chemical  mechanical  polishing  (CMP).  The  ultimate  proof  of  improved 
substrate  material  is  the  performance  of  device  layers  on  these  substrates.  EMRL  has  agreed  to 
assist  AFRL  in  this  fashion  by  growing  epi  layers  on  both  CMP  and  etched  substrate  material 
and  delivering  this  material  to  AFRL  for  device  fab  and  analysis.  EMRL  wiU  also  work  on  its 
own  CMP  methods  and  will  provide  identical  material  to  AFRL  for  comparison.  The  second 
subtask  involves  the  anneal  of  ion  implants  and  is  a  collaboration  between  AFRL,  Purdue 
University  (Dr.  M.  Capano)  and  EMRL.  EMRL,  using  its  silane-based  CVD  reactor,  will 
perform  annealing  experiments  in  an  attempt  to  develop  a  method  of  fully  activating  both  A1  and 
N  implants  while  maintaining  surface  morphology.  Indeed,  EMRL  has  already  received  several 
implanted  samples  from  Dr.  Capano  to  begin  this  effort  and  has  ordered  a  dedicated  quartz  tube 
to  serve  as  an  annealing  furnace. 

Both  of  these  activities  are  the  subject  of  a  proposal  to  AFOSR  under  the  Summer  Faculty 
Research  Extension  Program  (SFREP)  which  will  be  submitted  in  the  near  future. 
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Appendix  A 


Capacitance-Voltage  (CV)  and  Current- Voltage  (IV)  data  from  samples  MSU-98-4H-57  and 
MSU-98-4H-69a,  taken  with  a  Hg-probe  by  the  Emerging  Materials  Research  Laboratory 
(EMRL)  to  support  the  characterization  effort  undertaken  at  AFRL  under  the  SERF.  Note  that 
sample  57  is  slightly  p-type  and  has  a  zero  bias  capacitance  of  approx.  1.5  pF,  whereas  sample 
69a  is  slightly  n-type  with  a  capacitance  of  less  than  1  pF. 
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EMRL  sample  S>§-^H-57 


EMRL  sample  ^§-^H-jS9a 
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COMPUTER  MODELLING  OF  NONLINEAR  VISCOUS  PANEL  FLUTTER 


R.  Panneer  Selvam 
Associate  Professor 
Department  of  Civil  Engineering 
University  of  Arkansas 

Abstract 

Nonlinear  viscous  panel  flutter  is  reported  for  the  first  time  in  Selvam ,  Visbal  and 
Morton  [1].  Further  study  on  Euler  and  viseous  panel  flutter  is  reported  in  this  work. 
Divergenee  oeeurred  when  the  Mach  number  M  is  less  than  one.  There  are  two 
equilibrium  positions  during  divergence  depending  upon  the  initial  perturbations  is 
reported  in  [3]  and  similar  two  equilibrium  positions  are  simulated  in  this  work  for  both 
Euler  and  viscous  flows.  The  viseous  flow  features  are  much  more  complex  than  Euler 
flow.  Many  small  vortices  are  observed  during  viseous  negative  divergenee.  The  flow 
details  and  pressure  eontours  are  reported.  Computer  model  to  study  the  three- 
dimensional  panel  flutter  is  implemented.  At  this  time  the  nonlinear  plate  equation  is 
solved  by  finite  difference  method  in  spaee  and  Newmark-P  method  in  time.  Verification 
is  underway. 
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COMPUTER  MODELLING  OF  NONLINEAR  VISCOUS  PANEL  FLUTTER 


R.  Panneer  Selvam 


1.  Introduction 

In  the  previous  work  [1]  Selvam  et.  al.  has  shown  the  importance  of  coupled  solution 
procedure  to  that  of  lagged  approach  in  the  aeroelastic  or  fluid-structure  problems.  By  the 
coupled  approach  one  could  achieve  second  order  accuracy  by  using  fluid  and  structure 
solvers  of  second  order.  The  computed  stability  boundary  for  flutter  using  Euler  flow  for 
two-dimensional  panel  is  in  excellent  agreement  with  Dowell  [2]  and  Bendiksen  &  Davis 
[3].  Stability  boundary  for  viscous  flow  at  Re=l.xlO®  is  also  reported.  This  is  the  first 
time  one  could  simulate  the  panel  flutter  for  viscous  flow.  In  this  work  further 
investigation  is  done  for  2D  panel  flutter  to  understand  the  viscous  effect  and  the  results 
are  reported.  Attempt  is  made  to  model  the  3D  panel  flutter  using  the  fluid  code  reported 
in  [4]  and  by  developing  a  dynamic  nonlinear  plate  program  using  finite  difference 
method. 

2.  Method  of  Solution 

The  flow  equations  are  solved  by  Beam-Warming,  alternate-direction,  implicit  scheme. 
The  structural  equations  are  solved  by  the  implicit  Newmark-P  method  as  discussed  in 
[1].  The  equations  are  approximated  in  space  by  finite  difference  method.  The  details  of 
the  2D  equations  are  given  in  [1]. 

2. 1.  Nonlinear  Dynamic  Analysis  of  Plate 

For  3D  also  the  flow  equations  are  solved  similar  to  2D  as  given  in  [4].  The  three 
dimensional  (3D)  plate  equations  can  be  solved  by  either  finite  element  method  (FEM)  or 
finite  difference  method  (FDM)  for  simple  regions.  For  complex  regions  like  cantilever 
plate  or  wing  finite  element  method  is  preferred.  In  this  work  as  a  start,  rectangular  panels 
are  considered  and  hence  FDM  is  used.  The  governing  equations  for  the  nonlinear 
dynamic  analysis  of  plate  as  given  in  [5-8]  is  as  follows: 

DV‘'w=  p  +  Nx  w,xx  +  Ny  w,yy  +  2Nxy  w,xy  -  m  w,tt  (1) 

Nx,x  +  Nxy,y=0  (2) 

Nxy,x  +  Ny,y=0  (3) 

Where  V''=  w,xxxx+2w,xxyy+w,yyyy. 
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Nx=  B[u,x+(w,x)^/2+o(v,y-i-(w,y)^/^)] 

Ny=  B[v,y+(w,y)V2+o(u,x+(w,x)V2)] 

Nxy  =  B((l-u)/2)[u,y+v,x+w,x+w,y] 

D=EhV(12(l-u'))and  B=Eh/(l-u') 

Here  u,v  and  w  are  the  displacements  in  the  x,  y  and  z  directions,  Nx  and  Ny  are  the 
stresses  in  the  x  and  y  direction ,  Nxy  is  the  shear  stresses,  E  is  the  modulus  of  elasticity, 
h  is  the  thickness  of  the  plate ,  o  is  the  Poisson’s  ratio,  p  is  the  pressure  and  m  is  the  mass  per 
unit  area.  Here  the  terms  following  a  comma  represents  the  differentiation  with  respect  to  that 
variable.  Here  it  is  assumed  that  the  inertial  effect  in  the  axial  direction  is  very  small  and  hence  it 
is  neglected. 

The  equations  are  highly  nonlinear  and  also  the  equation  in  the  z  direction  is  fourth  order.  The 
solution  procedure  used  by  Vallabhan  and  Selvam  [9]  is  used  in  this  work  for  equation  1 .  The 
FDM  stencils  for  equal  spacing  is  given  in  [5].  At  this  time  the  plate  is  assumed  to  be  simply 
supported  at  all  edges  and  the  displacements  u  and  v  along  the  edges  are  also  assumed  to  be  zero. 
The  computed  displacement  for  static  problem  reported  in  [7]  is  in  excellent  agreement  with 
FEM  solution  in  [7].  Then  the  program  is  modified  to  incorporate  the  Newmark-P  method  for 
dynamic  problems.  Currently  it  is  in  the  process  of  verifying  the  fluid-structure  interaction  code. 
The  plate  equations  are  rewritten  in  a  non-dimensional  form  as  follows: 

(p/X,)  V'*  w  =  p  (p  +  Nx  w,xx  +  Ny  w,yy  +  2Nxy  w,xy)  -  w,tt  (1) 

Nx,x  +  Nxy,y=0  (2) 

Nxy,x  +  Ny,y  =0  (3) 

Where  V''=  w,xxxx+2w,xxyy+w,yyyy, 

Nx=  tl  [u,x+(w,x)^/2+u(v,y+(w,y)^/2)] 

Ny=  tl  [v,y+(w,y)^/2+u(u,x+(w,x)^^)] 

Nxy  =  tl((l-u)/2)[u,y+v,x+w,x+w,y] 

p  =  pj/(psh),  X=  pocU^^P/D  and  tl=B/(p«u^H ) 


Here  u,  v  and  w  are  non-dimensional  displacements  with  respect  to  the  length  of  the  plate  1, 
p  is  the  mass  ratio,  X  is  the  dynamic  pressure  and  tl  is  the  non-dimensional  term  for  the 
membrane  forces.  Further  work  is  underway  to  solve  the  plate  equations  using  FEM  similar 
to  Xue  and  Mei  [10]. 
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3.  Results 


3.1  Divergence  of  2D  Panels 

Panel  response  in  the  stable  and  flutter  regions  are  investigated  at  M=1 .2  for  Euler  and  viscous 
flows  in  Selvam  et.  al.  [1] .  When  M  is  less  than  1  and  X  is  greater  than  critical  X,  rather  than 
flutter,  divergence  occurs.  Davis  and  Bendiksen  [3]  found  that  for  Euler  flow  there  are  two 
equilibrium  positions  depending  upon  the  initial  perturbation.  In  this  work  also  it  is  observed 
that  if  the  initial  velocity  is  positive  (negative)  then  the  divergence  is  in  the  positive  (negative) 
direction  as  shown  in  Fig.  1  and  2.  For  the  illustration  p,  =0. 1  and  X  =200  is  considered.  The 
plate  is  divided  into  50  equal  spacing  and  the  time  step  is  considered  to  be  0.05.  The  nonlinear 
coefficient  tl  for  2D  used  in  [1]  is  125.  To  study  the  transonic  features  in  detail  tl  is  considered 
to  be  25.  When  the  deflection  is  around  0.021,  shock  is  noticed  at  the  front  and  back  of  the  plate 
for  negative  divergence  as  shown  in  Fig.  3.  For  positive  divergence  the  shock  appears  at  the  end 
of  the  plate  the  Mach  number  ranges  from  0.7  to  1 .2. 

In  the  case  of  viscous  flows  also  divergence  occurred  similar  to  Euler  flow.  But  the  flow  features 
are  much  complicated  as  shown  in  Fig.  5  to  Fig.  1 0.  For  negative  divergence,  rather  than 
coming  to  a  stand  sill  position  as  in  Euler  flow,  the  displacements  at  the  midpoint  and  quarter 
point  of  the  plate  are  oscillating  as  in  Fig.  5.  Even  when  the  time  step  is  reduced  to  0.01  and  ran 
for  longer  time  this  oscillation  is  noticed.  For  positive  divergence,  the  oscillation  is  very  small  as 
in  Fig.  6.  When  small  time  step  is  used  and  ran  for  longer  time  the  oscillating  amplitude 
approached  close  to  zero.  These  oscillations  may  be  due  to  the  vortices  created  close  to  the  plate 
as  shown  in  Fig.  10.  The  effect  is  more  for  negative  divergence  because  many  vortices  are 
created  as  in  Fig.  10.  The  pressure  coefficients  and  the  deflected  shape  of  the  plate  is  plotted  in 
Fig.  7  and  8  for  negative  and  positive  divergence.  The  pressure  changes  sign  more  th^  once  on 
the  plate  because  of  the  complex  flow  over  the  plate  during  viscous  effect.  Experiments  were 
conducted  by  computing  flow  over  cavity  and  considering  the  stmcture  to  be  rigid.  Then  these 
oscillations  were  not  noticed  as  much.  Hence  this  may  be  essentially  due  to  the  fluid-stmcture 
interaction.  The  pressure  contours  are  plotted  in  Fig.  9  for  both  cases.  For  the  negative 
divergence  pressure  waves  are  noticed.  This  may  lead  to  aero-acoustic  problems.  These  pressure 
waves  disappeared  when  the  flow  is  computed  considering  the  plate  to  be  rigid  after  it  diverged. 
Hence  it  is  clearly  due  to  fluid-stmcture  interaction. 

3.2  Three-dimensional  panel  flutter 

The  nonlinear  dynamic  analysis  program  is  verified  and  combined  with  fiow  code. 
Currently  the  fluid  stmcture  code  is  verified.  Further  work  is  under  way  to  implement 
FEM  procedure  to  solve  the  response  of  the  plate. 


4.  Conclusions 

Nonlinear  viscous  panel  flutter  is  reported  for  the  first  time  in  Selvam  et.  al.  [1].  Further 
study  on  Euler  and  viscous  panel  flutter  is  reported  in  this  work.  Divergence  occurred 
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when  the  Mach  number  M  is  less  than  one.  There  are  two  equilibrium  positions  during 
divergence  depending  upon  the  initial  perturbations  is  reported  in  [3]  and  similar  two 
equilibrium  positions  are  simulated  in  this  work  for  both  Euler  and  viscous  flows.  The 
viscous  flow  features  are  much  more  complex  than  Euler  flow.  Many  small  vortices  are 
observed  during  viscous  negative  divergence.  The  flow  details  and  pressure  contours  are 
reported.  Computer  model  to  study  the  three-dimensional  panel  flutter  is  implemented.  At 
this  time  the  nonlinear  plate  equation  is  solved  by  finite  difference  method  in  space  and 
Newmark-P  method  in  time.  Verification  is  underway. 
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Fig.l.  Panel  response  during  transonic  divergence  in  negetive  direction  for  M=0.9.  Using 
Euler  floNV.  Simply  supported  panel,  |j.=.l  and  X=200. 


Fig.  2.  Panel  response  during  transonic  divergence  in  positive  direction  for  M=0.9.  Using 
Euler  flow.  Simply  supported  panel,  |x=.l  and  A,=200. 
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Fig.  3.  Panel  surface  Mach  number  and  deflected  shape  at  negetive  divergence  for 
M=0.9.  Using  Euler  flow.  Simply  supported  panel,  p=.l  and  X,=200. 


Fig.  4.  Panel  surface  Mach  number  and  deflected  shape  at  positive  divergence  for  M=0 
Using  Euler  flow.  Simply  supported  panel,  p=.  1  and  X=200. 
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quarter-point 


Fig.5.  Panel  response  during  transonic  divergence  in  negetive  direction  for  M==0.9.  Using 
viscous  flow.  Simply  supported  panel,  fi=.l  and  A.=200. 


- quarter-point 

-  midpoint 


Fig.  6.  Panel  response  during  transonic  divergence  in  positive  direction  for  M=0.9.  Using 
viscous  flow.  Simply  supported  panel,  |li=.1  and  ?i=200. 


Fig.  7.  Panel  surface  pressure  coefficient  and  deflected  shape  at  negetive  divergence  for 
M=0.9.  Using  viscous  flow.  Simply  supported  panel,  |a,=.l  and  A,=200. 


Fig.  8.  Panel  surface  pressure  coefficient  and  deflected  shape  at  positive  divergence  for 
M=0.9.  Using  viscous  flow.  Simply  supported  panel,  p=.l  and  X.=200. 
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Fig.  9.  Pressure  contours  for  negetive  and  positive  divergence  for  M=0.9.  Using  viscous 
flow.  Simply  supported  panel,  |x=.  1  and  A,=200. 
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Fig.  10.  Vorticity  contours  for  negetive  and  positive  divergence  for  M= 
flow.  Simply  supported  panel,  |j,=.l  and  X,=200. 


=0.9.  Using  viscous 
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ABSTRACT 

The  cyclic  passages  of  wakes  from  upstream  engine  components  introduce  severe  fluctuations  into  the 
aerothermal  environment  of  turbine  blades.  The  associated  adverse  effects  are  amplified  at  higher  flight  altitudes,  as 
the  attendant  reduction  in  Reynolds  number  results  in  unsteady  flow  separation  on  the  blades  and  a  corresponding 
decrease  in  engine  performance.  In  the  present  study,  computations  simulate  the  imsteady  interactions  of  small  cylinder 
wakes  with  a  cascade  of  Langston  turbine  blades  in  a  sequence  of  Reynolds  number  cases  (660  K  -  16.5  K).  Vortex 
shedding  from  the  cylinders  produces  an  unsteady  environment  for  the  downstream  boundary  layers  that  may  have 
adjustable  content  in  fiequency  and  length  scale.  Results  from  the  existing  VBI2D  code  are  presented  concerning  the 
flowfield,  pressure  distributions,  and  total  pressure  loss  coefficient  Flow  structures  are  seen  that  are  rich  in  oscillations 
and  separations,  especially  at  the  lower  Reynolds  numbers.  A  somewhat  surprising  structure  is  seen  in  the  generation 
of  total  pressure,  which  indicates  the  need  for  furtfier  study.  A  provisional  explanation  is  offered  for  the  corresponding 
behavior  in  pressure  loss  coefficient  by  means  of  a  basic  analytical  model. 
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A  Computational  Study  of  Turbine  Blade  Interactions 
with  Cylinder  Wakes  at  Various  Reynolds  Numbers 

Paavo  Sepri 
INTRODUCTION 

Objectives 

One  objective  of  the  present  study  has  been  to  provide  further  computational  characterizations  of  two- 
dimensional  unsteady  flow  in  turbine  stages  that  are  progressively  influenced  by  flow  separations  associated  with 
decreasing  Reynolds  number.  As  modem  aircraft  are  designed  to  operate  in  an  ever  increasing  range  of  altitudes,  with 
the  goal  of  not  compromking  engine  performance,  it  becomes  challenging  to  design  and  match  components  that  remain 
efficient  under  varying  operating  conditions.  In  particular,  at  sea  level,  the  Reynolds  number  (based  on  axial  chord) 
of  flow  around  a  low  pressure  turbine  blade  is  approximately  500,000.  As  the  altitude  increases  to  50  Kft,  the  ambient 
density  decreases  by  an  order  of  magnitude,  resulting  in  a  corresponding  decrease  in  Reynolds  number.  This  change 
in  flow  condition  k  often  accompanied  by  a  dramatic  flow  separation  on  the  suction  side  of  the  blade,  which  results  in 
a  large  loss  in  turbine  stage  performance. 

Another  issue  which  influences  turbine  blade  performance  is  related  to  the  unsteadiness  inherent  in  the  flow 
around  tire  blade,  which  k  composed  of  a  high  level  of  turbulence  as  well  as  the  periodic  fluctuations  produced  by  wake 
passages  fi'om  stator  vanes  upstream  of  the  turbine  blades.  These  Im'ge  fluctuations  cause  increased  heat  transfer  rates 
to  the  turbine  blades,  and  also  produce  variations  in  the  location  and  extent  of  transitional  boundary  layers.  Furthermore, 
at  lower  Reynolds  numbers,  boundary  layers  may  re-laminarize  in  regions  of  favorable  pressure  gradients.  In  tire  present 
study,  die  effects  of  wake  passages  have  been  introduced  by  placing  a  row  of  cylinders  upstream  of  the  blades.  Vortex 
shedding  from  die  cylinders  provides  an  influence  on  the  turbine  blade  boundary  layers  that  may  be  varied  both  in  scale 
size  and  frequency  by  adjusting  the  cylinder  diameter  and  flow  rate.  The  objective  here  has  been  to  utilize  an  existing 
computer  code,  termed  Vane-Blade  Interaction  [1,2],  to  provide  two-dimensional  unsteady  simulations  of  these  effects. 

Background 

The  Vane-Blade  Interaction  (VBI)  code  has  been  developed  by  several  investigators  [1-4]  at  the  Allison  Engine 
Company  under  contract  to  the  United  States  Air  Force.  The  earlier  two-dimensional  version  (VBI2D)  has  recently  been 
extended  to  include  3-D  unsteady  computational  capability,  although  this  latter  version  is  still  under  evaluation.  In  a 
previous  study,  Rivir  et  al.  [5]  utilized  the  VBI2D  code  to  calculate  flow  fields  for  Langston  [6,7]  turbine  blade  cascades 
in  the  steady  state  mode  and  without  wake  effects  for  a  variety  of  Reynolds  numbers  and  cascade  solidities.  The  code 
incorporates  the  Baldwin-Lomax  [8]  two-layer  algebraic  turbulence  model.  Among  other  results,  it  was  observed  that 
the  Langston  cascade  (at  the  original  pitch  to  chord  spacing  of  0.93)  demonstrated  separation  at  Re=50K,  oscillating 
transition  at  lOOK,  and  attached  flow  for  higher  Reynolds  numbers. 
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Experimental  investigations  of  flow  in  turbine  passages  have  recently  been  reported  by  Blair  [9],  Baughn  et 
al.  [10],  and  Murawski  et  al.  [11].  Blair  has  noted  the  several  regions  of  high  heat  transfer  produced  by  three- 
dimensional  flow  effects  within  die  rotor  passage,  and  has  measured  the  effects  of  wall  roughness  on  heat  transfer. 
Furthermore,  he  discusses  the  sensitivity  to  changes  in  Reynolds  number  and  inlet  flow  angle.  Baughn  et  al.  present 
comparisons  of  linear  cascade  experiments  performed  at  UC  Davis  and  the  USAF  Academy.  The  cascade  geometries 
are  similar  to  those  of  earlier  tests  performed  by  Langston  et  al.  [6]  and  Graziani  et  al.  [12]  in  linear  cascades,  and  also 
to  those  of  Dring  et  al.  [13]  and  Blair  et  al.  [14]  in  rotating  tests.  In  the  Reynolds  number  range  of  67K  to  144K,  Baughn 
et  al.  note  that  an  elevated  level  of  ffee-stream  turbulence  influences  the  flow  in  augmenting  heat  transfer,  advancing 
the  location  of  transition,  and  removing  observed  flow  striations  from  the  blade  pressure  side  (presumably  caused  by 
GOrtler  vortices).  Furthermore,  at  the  lower  Reynolds  numbers,  evidence  is  presented  of  boundary  layer  separation  on 
the  blade  suction  side.  Murawski  et  al.  performed  experiments  in  the  linear,  2-D  airfoil  cascade  at  AFIT  at  Wright 
Patterson  Air  Force  Base.  The  Reynolds  number  (based  on  exit  velocity  and  suction  surface  length)  was  varied  in  the 
range  of  50K  to  300K,  and  the  level  of  free-stream  turbulence  (FST)  was  varied  in  the  range  of  1.1%  to  8.1  %. 
Separation  on  the  suction  surface  was  observed  for  all  Reynolds  numbers,  with  a  shrinkage  of  the  separation  zone  with 
increasing  Reynolds  number  and  increasing  turbulence  level.  Wake  surveys  with  a  hot  film  have  indicated  that  the 
width  of  the  wake  decreases  with  an  increase  of  FST,  and  that  it  also  decreases  slightly  with  increasing  Reynolds 
number.  These  trends  are  demonstrated  to  correlate  well  with  the  measured  decreases  in  pressure  loss  coefficient  as 
Reynolds  number  is  increased.  Calculations  using  the  VBI2D  code  also  indicate  this  trend  of  decreasing  loss  coefficient, 
although  the  effect  of  elevated  FST  is  not  included  in  the  calculations,  and  the  numerical  results  appear  to  underpredict 
the  experimental  values  in  some  cases. 

Among  the  many  other  studies  of  flow  in  turbine  blade  passages  are  those  of  Hodson  [15],  Zaccaria  and 
Lakshminarayana  [16],  and  Doorly  and  Oldfield  [17].  At  Cambridge  University,  Hodson  has  measured  velocities  and 
surface  pressure  distributions  by  means  of  instrumentation  fixed  to  the  rotor  system,  so  that  time-resolved  influences 
of  stator  wake  passages  could  be  directly  observed.  It  is  concluded  that  the  main  effects  of  unsteadiness  can  be 
associated  with  the  periodic  convection  of  the  wakes  through  the  blade  passage.  An  interpretation  of  the  origin  of  blade 
surface  velocity  fluctuations  is  offered,  although  it  is  also  stated  that  the  phase  variations  of  the  pressure  distributions 
along  the  blade  surfaces  are  not  understood.  At  the  Axial  Flow  Turbine  Research  Facility  at  Pennsylvania  State 
University,  Zaccaria  and  Lakshminarayana  have  utilized  a  two-dimensional  LDV  system  to  measure  both  time-resolved 
and  cycle-averaged  wake  flow  properties  in  a  rotor  passage.  Among  other  observations,  it  is  noted  that  wake 
interactions  with  the  rotor  leading  edge  cause  considerable  downstream  influences  as  the  wake  introduces  significant 
swings  in  the  effective  angle  of  attack.  Doorly  and  Oldfield  have  utilized  a  moving  cylindrical  wake  generator  upstream 
of  a  stationary  cascade  of  rotor  blades  at  the  Isentropic  Light  Piston  Tuimel  at  Oxford  University.  By  means  of 
Schlieren  photography,  heat  transfer  gauges,  and  surface  pressure  measurements,  they  show  that  the  passage  of  wakes 
and  shocks  initiate  leading  edge  separation  bubbles  on  the  turbine  blades.  The  bubbles  subsequently  convect  a  short 


28-4 


distance  downstream  and  then  collapse.  Unsteady  heat  transfer  measurements  indicate  that  a  collapsing  bubble  generates 
a  turbulent  boundary  layer  patch  which  also  is  swept  downstream,  causing  a  locally  elevated  rate  of  heat  transfer. 

A  related  topic  of  study  is  that  of  the  effects  of  elevated  levels  of  ffee-stream  turbulence,  which  are  generated 
by  components  upstream  of  the  turbine  stage.  Although  the  periodic  unsteadiness  caused  by  wake  passages  may 
dominate  downstream  flow  performance,  it  has  long  been  known  that  high  levels  of  FST  also  influence  heat  transfer, 
transition,  and  boundaiy  layer  development.  Recent  studies  aimed  at  increasing  the  level  of  generated  FST  have  utilized 
upstream  grids  that  include  active  blowing  through  holes  in  hollow  rods.  By  these  means,  Young  et  al.  [18]  have 
extended  heat  transfer  measurements  on  a  flat  plate  for  turbulence  intensities  ranging  up  to  25%.  Welsh  et  al.  [19]  have 
applied  the  same  technique  at  the  U.  S.  Air  Force  Academy  to  influence  flow  in  a  Langston  turbine  blade  cascade.  In 
both  cases,  higher  levels  of  FST  advance  the  location  of  transition  and  enhance  heat  transfer,  although  it  appears  that 
these  effects  may  reach  a  limiting  form  at  the  highest  levels  tested.  Funazaki  [20]  and  Funazaki  et  al.  [21]  have 
measured  the  effects  of  FST  and  wake  passages  on  heat  transfer  in  the  region  of  a  cylindrical  forebody  that  includes  film 
cooling  from  discrete  holes.  Information  concerning  film  cooling  effectiveness  was  obtained  through  liquid  crystal 
thermography,  which  demonstrates  that  elevated  levels  of  FST  and  increased  wake  Strouhal  numbers  both  have  adverse 
effects  on  film-cooling  effectiveness.  Fridman  [22]  has  recently  presented  an  algebraic  relaxation-length  model  of 
turbulence  to  simulate  FST  effects  in  flat  plate  turbulent  boundary  layers,  which  apparently  may  be  extended  to  turbine 
blade  flows. 

Computational  simulations  of  flow  in  turbine  blade  passages  have  included  two-dimensional  models,  and  more 
recently,  three-dimensional  models.  Abhari  [23]  has  simulated  rotor-stator  interactions  involving  both  steady  and 
unsteady  film  cooling  by  means  of  a  2-D  unsteady  code  that  couples  an  explicit  inviscid  algorithm  with  a  thin  shear  layer 
Navier-Stokes  implicit  scheme  solved  on  a  body-  fitted  boundary  layer  grid.  The  Baldwin-Lomax  turbulence  model  is 
used  to  specify  boundary  layer  transition  in  these  computations.  Comparisons  with  existing  experimental  results  indicate 
good  representations  of  film  cooling  on  the  surface  heat  flux,  except  in  regions  immediately  downstream  of  the  injection 
rows.  Large  scale  pulsations  of  static  pressure  in  the  flow  cause  pulsations  of  coolant  flow  from  the  film  holes,  causing 
choking  and  unchoking  in  some  cases.  Amone  and  Pacciani  [24]  have  developed  a  time-accurate  2-D  code  to  analyze 
rotor-stator  interactions  that  utilizes  a  fully  implicit  time  discretization  to  remove  stability  limitations.  Accelerating 
strategies  include  local  time  stepping,  residual  smoothing,  and  multigrid  computations.  The  code  has  been  applied  to 
a  highly  loaded  gas  turbine  stage,  for  various  combinations  of  stator-turbine  pitch  ratios. 

Three-dimensional  simulations  of  flow  in  engine  components  have  been  performed  by  Koya  and  Kotake  [25], 
Boyle  and  Jackson  [26],  and  Kang  and  Hirsch  [27].  Koya  and  Kotake  calculate  periodic,  transonic  flows  in  highly 
twisted  turbine  stages  using  a  finite-volume  integral  method  to  solve  the  Euler  equations.  Attention  has  been  given  to 
the  treatment  of  boundaiy  conditions  involving  periodicity  and  to  phase  relationships  between  time  dependent  stator 
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and  rotor  flow  flelds.  Boyle  and  Jackson  have  compared  the  results  of  two  different  steady-state  3-D  Navier-Stokes 
codes  applied  to  a  turbine  vane  with  end  walls,  with  the  objective  of  calculating  heat  transfer  and  pressure  distributions. 
Several  algebraic  models  for  turbulence  transition  have  been  compared  in  the  study,  and  the  Baldwin-Lomax  model  is 
among  these.  Heat  transfer  predictions  on  the  pressure  side  of  the  vane  have  agreed  well  with  experimental  data,  and 
a  transition  model  formulated  by  Chima  has  apparently  performed  the  best  on  the  suction  side.  Kang  and  Hirsch  have 
applied  a  3-D  time-dependent  Reynolds-averaged  Navier-Stokes  code  to  simulate  conditions  in  a  linear  compressor 
cascade  including  tip  clearance  with  respect  to  a  stationary  wall.  The  Baldwin-Lomax  turbulence  model  is  also  utilized. 
The  computations  appear  to  capture  the  development  of  secondary  flows  and  tip  leakage  flows.  However  the  approach 
to  separation,  leading  to  the  tip  leakage  vortex,  appears  to  be  underpredicted,  which  the  authors  speculate  to  result  from 
a  limitation  of  the  Baldwin-Lomax  turbulence  model. 

Lengthier  and  more  detailed  accounts  of  experimental  studies  of  flow  within  multi-stage  low  pressure  turbines 
have  been  given  in  the  dissertation  by  Halstead  [28]  at  Iowa  State  University  and  in  the  report  by  Domey  et  al.  [29]  at 
United  Technologies  Research  Center.  Halstead  has  taken  extensive  boundary  layer  measurements  near  both  pressure 
and  suction  surfaces  by  means  of  densely  packed  hot-film  sensors.  He  has  observed  regions  of  laminar,  transitional, 
and  turbulent  flows  that  are  influenced  by  the  passages  of  wakes  through  the  blade  rows.  At  low  Reynolds  numbers, 
Halstead  notes  that  the  suction  side  boundary  layer  remains  laminar  over  more  than  seventy  percent  of  the  surface. 
Domey  et  al.  have  utilized  the  numerical  scheme  developed  by  Rai  [30]  to  simulate  experiments  in  the  Large  Scale 
Rotating  Rig  at  UTRC.  Both  2-D  and  3-D  versions  have  been  implemented  to  study  the  migration  of  combustor  hot 
streaks  through  stator  and  rotor  stages.  The  2-D  code  indicates  that  the  blade  count  ratio  could  have  a  substantial  effect 
on  unsteady  flow  characteristics.  Results  from  the  3-D  code  agree  with  the  experimental  observation  that  hot  streaks 
tend  to  accumulate  on  the  pressure  side  of  rotor  blades,  thereby  creating  time-averaged  surface  hot  spots. 

Loss  mechanisms  in  turbomachines  are  described  in  some  detail  in  the  1993 IGTI  Scholar  Lecture  by  Denton 
[31].  He  describes  the  traditional  three  categories  of  profile  loss,  end-wall  loss,  and  leakage  loss.  In  the  present  study 
of  2-D  flow,  the  latter  two  categories  do  not  apply  directly.  Also  discussed  are  several  definitions  of  loss  coefficient, 
of  which  the  stagnation  pressure  loss  coefficient  for  the  turbine  blade  is  utilized  herein.  Denton  describes  the  connection 
of  the  various  loss  coefficients  to  the  increase  of  entropy  experienced  in  turbine  stages.  Of  relevance  to  the  pressure 
loss  suffered  through  flow  separation,  especially  at  low  Reynolds  numbers,  is  the  recent  computational  study  by  Muti 
Lin  and  Pauley  [32].  A  2-D  unsteady  incompressible  scheme  is  applied  to  flow  around  an  Eppler  387  airfoil  in  the 
Reynolds  number  range  60K  to  200K.  A  detailed  time  history  of  suction  side  separation,  vortex  development,  vortex 
pairing,  and  subsequent  convection  is  presented  for  several  angles  of  attack  of  the  airfoil.  The  mechanism  for  instability 
is  attributed  to  be  the  inviscid  Kelvin-Helmholtz  one,  with  2-D  large  scale  vortex  shedding  to  be  the  result. 
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RESEARCH  DESCRIPTION 


Computational  Methodology 

The  computer  code  used  for  the  present  study  has  been  developed  earlier  by  several  investigators  [1-4]  at  the 
Allison  Engine  Company  under  contract  to  the  United  States  Air  Force.  The  version  used  here  is  designated  VBI2D, 
as  it  calculates  vane-blade  flow  interactions  in  a  two-dimensional  unsteady  manner,  utilizing  a  Reynolds-averaged 
compressible  viscous  flow  model.  Specifics  of  the  computational  methodology  and  operating  procedures  are  described 
by  Rao  et  al.  [1,2].  Briefly,  an  0-grid  system  surrounds  each  vane  or  blade  element,  and  the  0-grid  is  embedded  into 
an  outer  H-grid,  which  is  contoured  to  conform  to  the  main  passages  between  the  elements  constituting  each  row  of 
vanes  or  blades.  The  computational  grid  used  for  the  present  study  is  shown  in  Fig,  1. 


Figure  1:  Cylinder/Langston  Blade  Geometry  and  Computational  Grids. 

The  geometry  is  taken  to  be  that  of  an  axisymmetric  body  of  revolution,  along  which  the  fluid  flows  tangent 
to  stream-surfaces.  Specific  vane  and  blade  geometries  are  defined  through  further  transformations,  while  the  flow  is 
restricted  to  lie  within  the  stream-surfaces,  thereby  eliminating  the  possibility  of  3-D  flow.  The  vane  and  blade  grid 
systems  may  be  stationary,  or  may  be  in  relative  rotation.  Interpolation  procedures  are  used  to  transfer  data  between 
the  overlapping  O-  and  H-  grids,  as  well  as  the  overlapping  vane  and  blade  H-grids.  The  numerical  method  utilizes  a 
five-stage  Runge-Kutta  scheme  with  implicit  residual  smoothing.  Turbulence  is  modeled  with  the  Baldwin-Lomax  [8] 
two-layer  algebraic  eddy  viscosity  model.  The  computational  procedure  has  consisted  of  an  initial  run  in  the  steady  state 
mode  to  set  up  an  approximate  flow  condition  (typically  2000  iterations  were  used),  followed  by  a  run  in  unsteady  mode 


28-7 


to  set  up  the  time  dependent  flow  (40,140  iterations),  and  finally  followed  by  an  unsteady  run  describing  one  periodic 
passage  of  wakes  with  respect  to  the  downstream  configuration.  During  this  last  period,  output  has  been  generated  at 
twenty  equal  time  intervals  for  each  Re  case,  in  order  to  resolve  flow  variations  within  each  cycle. 


Computational  Cases 


The  geometry  has  consisted  of  an  upstream  row  of  circular  cylinders  (R  =  0.1  in)  and  a  downstream  row  of 
Langston  turbine  blades  (axial  chord  4.0  in).  The  solidity  (ratio  of  pitch  to  axial  chord)  was  chosen  to  be  1.075  to 
conform  with  a  recent  experiment  [10]  at  the  Air  Force  Academy.  Each  spatial  period  consists  of  two  cylinders 
upstream  of  three  turbine  blades,  as  may  be  deduced  from  Fig.  1.  In  all  cases,  the  blade  row  was  held  stationary,  while 
the  upstream  cylinders  were  set  in  motion,  in  order  to  produce  the  passage  of  wakes.  The  upstream  flow  incidence,  with 
respect  to  the  horizontal,  was  chosen  to  be  45°,  close  to  the  conditions  of  previous  experiments.  The  axial  and 
circumferential  inlet  Mach  number  components  were  taken  to  be  0.2,  producing  a  total  Mach  number  of  0.2828.  The 
circumferential  Mach  number  of  the  cylinders  was  taken  to  be  0.1,  while  for  the  last  series  it  was  set  at  -0.1.  The 
upstream  stagnation  temperature  was  specified  to  be  520  °R.  In  order  to  simulate  the  change  in  Reynolds  number  (Re) 
that  occurs  with  increasing  altitude,  the  upstream  total  pressure  was  varied  in  seven  increments  from  a  maximum  of  14.7 
psia  to  a  minimum  of  0.3675  psia.  The  corresponding  change  in  density  results  in  a  variation  of  axial  chord  Re  from 
660K  to  16.5K,  as  indicated  in  Table  1.  The  beginning  of  a  temporal  period  may  be  chosen  arbitrarily  in  the  cycle. 
Here,  the  VBI2D  code  has  indicated  40,140  iterations  to  comprise  one  cycle  in  the  present  configuration.  The  center 
of  an  upstream  wake  passes  the  top  blade  nosetip  at  approximately  iteration  66,231,  as  indicated  in  Table  1. 


hcntkm 

(Tunc) 

Reynolds  Number  (Tbousands) 

16.5  33.0  66.0  99.0 

Reynolds  Number  (Thousands) 

132.0  264.0  660.0  99.0* 

40140 

100 

200 

300 

400 

500 

600 

700 

800 

42147 

101 

201 

301 

401 

501 

601 

701 

801 

44154 

102 

202 

302 

402 

502 

602 

702 

802 

46161 

103 

203 

303 

403 

503 

603 

703 

803 

48168 

104 

204 

304 

404 

504 

604 

704 

804 

50175 

105 

205 

305 

405 

505 

605 

705 

805 

52182 

106 

206 

306 

406 

506 

606 

706 

806 

54189 

107 

207 

307 

407 

507 

607 

707 

807 

56196 

108 

208 

308 

408 

508 

608 

708 

808 

58203 

109 

209 

309 

409 

509 

609 

709 

809 

60210 

110 

210 

310 

410 

510 

610 

710 

810 

62217 

111 

211 

311 

411 

511 

611 

711 

811 

64224 

112 

212 

312 

412 

512 

612 

712 

812 

66231 

113 

213 

313 

413 

513 

613 

713 

813 

68238 

114 

214 

314 

414 

514 

614 

714 

814 

70245 

115 

215 

315 

415 

515 

615 

715 

815 

72252 

116 

216 

316 

416 

516 

616 

716 

816 

74259 

117 

217 

317 

417 

517 

617 

717 

817 

76266 

118 

218 

318 

418 

518 

618 

718 

818 

78273 

119 

219 

319 

419 

519 

619 

719 

819 

80280 

120 

220 

320 

420 

520 

620 

720 

820 

*  cylinders  in  downward  motion. 


Table  1:  Identification  of  Output  Cases  with  Axial  Chord  Reynolds  Number  and 
Time  Interval  (Denoted  by  Iteration  Number). 
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Simplified  Analytical  Model 

As  an  overall  description  to  aid  in  the  interpretation  of  the  numerical  results,  it  is  convenient  to  consider  the 
following  idealized  model  of  steady  2-D  flow  across  a  turbine  stage.  The  model  is  derived  from  considerations 
presented  by  Hawthorne  [33]  and  private  communications  with  Dr.  Robert  Gray  [34]. 

Steady  flow  across  an  axisymmetric  stator/rotor  stage  may  be  represented  by  the  usual  equations  of  mass 
conservation,  momentum,  energy,  and  fluid  properties.  Here,  the  geometry  is  taken  to  have  the  possibility  of 
streamsurface  flaring,  as  represented  by  the  inlet  radial  thickness,  bj,  and  the  exit  radial  thickness,  b2.  The  vanes  and 
blades  may  either  be  stationary  or  in  rotation,  but  it  is  assumed  that  the  inlet  and  exit  flows  are  circumferentially  uniform 
in  an  averaged  sense.  It  is  assumed  that  radial  flow  components  are  negligible.  In  a  volume  averaged  sense,  in  a 
stationary  coordinate  frame,  the  basic  equations  may  be  listed  as  follows  in  the  commonly  used  notation  for  an  aribtrary 
segment  of  the  stage. 


PjViA^cospi  =  PjVjAjCOsPj  =  m, 

(1) 

m|vjSinPj  -  v,sinP,j  = 

= 

(2) 

(COsPj  -  v,cosp,]  =  p,A, 

-  P2A2  + 

(3) 

m 

S(r2  -  T.)  +  -  V,*) 

1 

•O' 

II 

(4) 

p  =  pRT. 

(5) 

Here,  the  cross-sectional  area.  A,  may  vary  with  streamsurface  height,  denoted  by  b.  Fy  and  represent  net  forces 
acting  on  the  fluid  in  vertical  and  horizontal  directions,  respectively,  as  exerted  by  vanes  and  blades  within  the  volume. 
The  angle,  p,  represents  the  local  flow  angle  relative  to  the  axial  direction.  It  is  assumed  that  hub  and  end-wall  effects 
are  not  included.  The  rotational  rate  of  the  turbine  blades  is  represented  by:  Q  =  .  Accordingly,  turbine  power 

extracted  from  the  flow  is  given  by:  W  =  tlRF^ ,  and  a  possible  heat  transfer  rate  into  the  flow  is  represented  by  Q . 

For  any  stage,  the  basic  equations  may  be  transformed  into  a  frame  moving  with  the  rotating  system  according 
to  the  usual  vector  decomposition,  as  follows: 


vcosP  =  ucosa  (6) 

vsina  =  usina  +  QR  (7) 

With  appropriate  subscripts,  these  equations  may  be  applied  at  either  the  inlet  or  the  exit.  Here,  u  and  a  represent  the 
flow  speed  and  relative  flow  angle  as  seen  in  the  rotating  frame. 
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Upon  squaring  these  expressions,  combining  at  each  station,  and  subtracting  the  totals  at  the  two  stations,  one 

obtains: 


uj*  -  Uj  =  V*  -  v/  -  2QR(ir,sinP,  -  v^sinP^), 


(8) 


which  may  be  combined  with  Eq.  (2)  to  yield  the  following: 


2  2  2 
U.  -  U,  =  V, 


2Q^. 

m 


Further  substitution  into  Eq.  (4)  results  in  the  energy  equation  as  viewed  in  the  rotor  frame: 


(9) 


(10) 


Here,  the  turbine  power  term  has  disappeared  since  the  blades  are  stationary  in  this  frame.  From  Eqs.  (1)  -  (5),  after 
transformation  to  the  rotor  frame,  one  may  obtain  the  explicit  relation: 

^  =  B  -  (b^  -  4ACr  (11) 

Uj  2A 


V  ~  1  cos  cc,  2  /  \ 

where:  A  =  YMt  cosset-  ""  Mj  \  B  =  +  Y^i  cosoc^fcoscCj  “  dj  ,  C  -  1  +  -  Mj  +  q. 

2  cos  ttj  ^ 

Here,  d  and  q  represent  dimensionless  drag  and  heating  applied  to  the  stage,  respectively.  The  dimensionless  lift  alters 

the  flow  angle  through  die  stage.  Therefore,  it  is  evident  that  the  mean  flow  across  the  turbine  stage  depends  on  blade 

geometry  (which  influences  flow  deflection),  drag  (d),  and  heating  (q),  in  addition  to  y  and  M,.  Other  flow  properties 

may  be  explicitly  obtained  from  the  above  representation.  In  particular,  the  pressure  loss  coefficient  is  expressed  as 

follows: 


It  is  noted  that  the  static  pressure  ratio  is  frame  independent,  whereas  the  Mach  numbers  depend  on  fimne  of  reference. 
Therefore,  the  pressure  loss  coefficient  expressed  in  a  stationary  fimne  varies  with  the  rotational  speed  of  the  rotor.  A 
numerical  case  is  presented  later,  as  it  applies  to  the  computational  research  considered  herein. 
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RESULTS 


Flow  Description 


Typical  velocity  flowfields  are  shown  in  Figs.  2a  and  2b  for  high  (660K)  and  low  (16.5K)  Reynolds  numbers, 
respectively,  as  computed  using  VBI2D  (cases  713  and  1 13  in  Table  1).  It  is  evident  that  the  suction  side  boundary  layer 
remains  largely  attached  for  the  high  Re  case,  while  it  separates  approximately  midway  along  the  axial  chord  for  the 
other  case.  A  comparison  of  the  various  runs  reveals  that  the  effective  flow  incidence  at  the  blade  nosetip  may  shift  by 
±20  deg.  during  the  passage  of  an  upstream  wake. 
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Figures  2a  and  2b:  Constant  Velocity  Contours  for  Cases  (a)  713  and  (b)  113. 
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(b) 
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Corresponding  skin  friction  distributions  are  shown  in  Figs.  3a  and  3b  (note  the  different  scales  used  for 
increased  clarity).  In  the  present  convention,  reverse  flow  on  the  suction  side  is  indicated  by  positive  values  of  skin 
friction,  while  streamwise  flow  is  indicated  by  negative  values.  These  indications  are  reversed  in  sign  on  the  pressure 
side.  Some  of  the  oscillations  occurring  on  the  suction  side  are  peiiiaps  attributable  to  the  influence  of  cylinder  induced 
vortices,  while  the  aft  flow  reversals  at  low  Re  (Case  1 13)  are  connected  with  vortices  originating  from  the  turbine  blade 
itself. 


Figures  3a  and  3b:  Turbine  Blade  Skin  Friction  for  Cases  (a)  713  and  (b)  113. 


The  locations  of  transition  to  turbulence,  as  set 
by  the  Baldwin-Lomax  model,  may  be  observed  in  Fig. 
4  for  Case  713,  in  which  the  turbulent  coefficient  of 
viscosity  is  plotted  at  one  grid  point  away  from  the 
surface  along  the  periphery  of  the  turbine  blade.  It  is 
somewhat  curious  that  the  model  is  triggered  at  two 
isolated  locations  near  the  leading  edge  of  the  blade 
and  briefly  along  the  aft  pressure  side.  The  low  Re 
cases  remain  totally  laminar,  whereas  for  Case  713,  the 
main  transition  occurs  at  two  locations  on  the  suction 
side  near  mid-chord. 


X 

Figure  4:  Turbulent  Viscosity  Near  Blade  Surface 
(Case  713).  Baldwin-Lomax  Model 
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Pressure  Coefficient 


Typical  pressure  coefficient  distributions  on  the  turbine  blade  surface  are  shown  in  Figs.  5a  -  5d.  Here,  the 
reference  pressure  is  taken  to  be  the  constant  static  pressure  imposed  upstream  of  the  cylinders  at  the  start  of  each  Re 
case.  As  each  run  progresses,  the  ambient  static  pressure  level  changes  with  time  as  power  is  either  added  or  removed 
through  the  rotation  of  the  cylinder  stage.  Presumably,  the  pressure  level  adjusts  itself  according  to  the  emanation  of 
acoustic  waves  through  upstream  and  downstream  boundaries.  The  choice  of  a  constant  reference  pressure  for  each  Re 
case  proves  useful  in  comparing  pressure  distributions  at  different  times,  although  the  Cp  distributions  shown  in  the 
figures  exhibit  uncharacteristically  high  or  low  levels  owing  to  the  shift  in  actual  ambient  pressure  with  time.  Pressure 
differentials,  both  vertical  and  axial,  decrease  significantly  at  lower  Re,  as  the  degree  of  effective  suction  is  reduced. 

Normally,  one  associates  with  separated  flow  an  increase  of  airfoil  drag.  However,  since  the  turbine  blade  tail  section 
is  highly  inclined  to  the  axial  direction,  a  loss  of  relative  suction  corresponds  to  a  decrease  in  both  axial  pressure  drag 
and  torque. 


Figures  5a  -  5d:  Cp  vs.  Axial  Distance  for  (a)  Case  713  and  (b)  Case  1 13. 
Cp  vs.  Circumferential  Distance  for  (c)  Case  713  and  (d)  Case  113. 
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Flow  Periodicity 


One  objective  of  the  present  study  has  been  to  observe  the  time  dependent  effects  of  cylinder/blade  interactions. 
For  this  purpose,  output  from  the  VBI2D  code  has  been  extracted  at  twenty  time  intervals  within  one  flow  cycle,  as 
indicated  in  Table  1,  after  the  unsteady  computations  appeared  to  have  reached  an  approximately  periodic  condition. 
This  periodicity  was  checked  for  a  low  Reynolds  number  case  (16.5K)  by  continuing  the  computations  for  an  additional 
period.  The  results  shown  in  Figs.  6a  and  6b  demonstrate  the  high  degree  of  similarity  for  two  neighboring  cycles.  It 
may  be  expected  that  such  histories  would  not  be  identical  owing  to  random  differences  of  vortex  shedding,  although 
further  computations  over  several  periods  have  not  yet  been  attempted. 


Figures  6a  and  6b:  Cylinder  and  Blade  Lift  Histories  (Re  =  16.5K)for  Two  Cycles 
Iterations:  (a)  40140-80280  (b)  80280-120420. 

Variations  in  lift  histories  at  two  Reynolds  numbers  are  shown  in  Figs.  7a  and  7b.  At  the  higher  Re  (660K), 
the  blade  lift  oscillates  at  a  frequency  that  is  approximately  the  same  as  that  of  the  cylinder,  whereas  in  Fig.  7b  (33K), 
the  blade  frequency  is  lower.  The  cylinder  shedding  frequency  evidently  decreases  with  decreasing  Re,  which  implies 
a  change  in  Strouhal  number,  since  the  flow  velocity  is  not  varied  here. 


Figures  7a  and  7b:  Lift  Histories  for  Cases  (a)  Re  =  660K  and  (b)  Re  —  33K. 
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Pressure  Profiles 


The  stagnation  pressure  loss  coefficient,  as  defined  by  Eq.  12,  is  computed  for  the  VBI2D  cases  by  performing 
a  numerical  quadrature  in  respective  vertical  cuts  in  order  to  find  averaged  values  for  the  pressures.  The  inlet  location 
was  chosen  to  be  the  first  column  of  the  blade  H-grid,  and  the  exit  location  was  the  last  column.  Sample  vertical 
distributions  of  the  pressures  are  shown  in  Figs.  8a  and  8b.  Here,  fluctuations  are  seen  both  upstream  and  downstream 
of  the  turbine  blade.  Presumably,  some  of  these  variations  may  be  due  to  influences  of  vortices  swept  downstream  from 
the  cylinders,  and  by  the  modification  of  total  pressure  in  the  unsteady  flow  around  the  blades.  In  particular,  it  is 
interesting  to  note  the  distribution  of  total  pressure  downstream  of  the  turbine  blade.  In  some  locations,  the  total 
pressure  rises  significantly  above  its  upstream  value,  whereas  in  other  regions  of  the  wake,  the  level  is  markedly  lower. 
The  reduction  of  total  pressure  by  viscous  losses  is  to  be  expected,  but  its  generation  may  be  somewhat  surprising. 


Figures  8a  and  8b:  Vertical  Pressure  Profiles  for  Cases  (a)  713  and  (b)  113. 


Alternating  regions  of  high  and  low  stagnation 
pressure  also  occur  in  the  wakes  of  the  cylinders.  In 
Fig.  9,  near  peak  locations  in  the  wake,  the  stagnation 
pressure  exceeds  its  level  attained  upstream  of  the 
cylinder.  While  such  a  circumstance  is  not  to  be 
expected  for  steady  flows  around  stationary  objects,  it 
apparently  can  occur  for  unsteady  flow  about  stationary 
objects,  and  certainly  for  flow  about  moving  objects 
which  impart  mechanical  energy  to  the  flow.  A 
preliminary  literature  search  by  the  author  has  not 
yielded  prior  mention  of  this  basic  phenomenon  in 
regard  to  wake  flows  past  stationary  cylinders. 


Figure  9:  Pj  Contours  for  Case  714. 
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Pressure  Loss  Coefficient 


Variations  in  total  pressure  loss  coefficient  within  one  cycle  are  shown  in  Figs.  10a  and  10b  for  two  cases. 
Large  oscillations  occur  within  the  cycle  for  the  low  Re  case,  so  that  Yp  is  even  negative  at  times.  It  should  be  noted 
that  flow  structures,  which  are  convected  streamwise  through  the  blade  passages,  would  involve  a  phase  lag  between 
exit  and  inlet  locations,  whereas  Yp  provides  an  instantaneous  comparison  between  these  locations.  The  higher  Re 
case  exhibits  no  negative  excursions. 


Figures  10a  and  10b:  Total  Pressure  Loss  Coefficient  Variation  Over  One  Cycle  for  Re  of  (a)  33K  and  (b)  660K. 


The  results  of  averaging  Yp  over  ten  equally  spaced 
time  intervals  during  each  cycle  are  shown  in  Fig.  1 1  as 
Re  is  varied.  These  results  are  somewhat  contrary  to 
earlier  expectations.  From  a  previous  study  [1 1],  it  was 
anticipated  that  Yp  would  have  small  values  at  higher 
Re;  that  there  would  be  a  marked  break  in  the  trend 
near  lOOK;  and  that  Yp  would  increase  dramatically  at 
lower  Re  owing  to  the  large  losses  associated  with 
suction  side  separation.  The  present  results  do  not 
support  this  view  in  that  the  calculated  Yp  remains 
fairly  constant  (range  of  0.2  —  0.3),  with  a  peak 
occurring  in  the  distribution  near  Re  =  132K. 


Reynolds  Number  (Thousands) 
Figure  11:  Yp  Averaged  Over  One  Cycle  vs  Re. 
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Comparison  with  Anafytical  Modei 


In  an  effort  to  reconcile  fliis  unexpected  result,  computations  have  been  applied  to  the  simple  analytical  model 
presented  ealrier.  A  plot  of  Yp  as  a  function  of  exit  angle  and  dimensionless  drag  is  presented  in  Fig.  12.  The  inlet  flow 
angle  is  taken  to  be  45  deg.  and  the  inlet  Mach  number  is  0.3.  As  expected,  the  pressure  loss  coefficient  increases  with 
increasing  drag,  but  it  is  of  interest  to  note  that  with  increasing  turbine  blade  camber  (resulting  in  a  larger  flow 
deflection),  Yp  decreases  and  may  even  become  negative.  Evidently,  within  this  model,  the  compressive  effect  on  the 
pressure  side  of  the  blade  can  overcome  losses  due  to  drag,  provided  that  the  flow  deflection  can  be  maintained  to  be 
large.  A  possible  explanation  for  die  earlier  VBI2D  results  may  then  be  as  follows.  At  higher  Reynolds  numbers,  with 
minimal  flow  separation,  the  exiting  flow  angle  is  relatively  large  (approximately  -65  deg.  from  Fig.  2a),  while  the 
tuibine  blade  drag  is  relatively  high  (refer  to  Fig.  5c ).  As  the  Reynolds  number  decreases,  the  suction  side  boundary 
layer  separates,  thereby  decreasing  turbine  blade  effectiveness,  and  reducing  the  flow  deflection  angle.  However,  owing 
to  the  relative  loss  of  suction,  both  lift  and  drag  are  reduced  on  the  blade.  According  to  Fig.  12,  it  is  possible  for  the 
pressure  loss  coefficient  to  remain  relatively  constant  in  this  process,  provided  the  drag  decreases  as  the  deflection  angle 
decreases.  It  had  been  anticipated  earlier  that  with  flow  separation,  the  flow  angle  would  decrease  but  that  the  drag 
would  increase,  resulting  in  a  large  increase  of  Yp.  This  explanation  is  to  be  regarded  as  being  provisional,  with  further 
investigation  recommended. 


Figure  12:  Total  Pressure  Loss  Coefficient  (Yp)  vs 
Drag  and  Flow  Deflection  (a2)  Using  Eq.  (12). 
(a,  =  45«;  M,  =0.3) 
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CONCLUSIONS 


The  unsteady  VBI2D  code  has  been  applied  to  a  sequence  of  seven  different  Re  flows  around  a  stationary 
cascade  of  Langston  turbine  blades  influenced  by  die  wakes  of  moving  upstream  cylinders.  Vortex  shedding  and  wake 
deficits  cause  <!ignifirant  shifts  (+/-  20  deg. )  in  the  effective  flow  incidence  at  the  blade  nosetip.  At  the  lower  Reynolds 
numbers  (16.5K-66K),  extensive  suction  side  separations  have  been  computed  and  quantified,  whereas  in  the  higher 
Re  range  (132K-660K),  flow  separation  is  non-existent  or  of  minimal  extent.  Computations  of  the  total  pressure  loss 
coefficient  are  presented  at  twenty  equal  time  steps  within  a  flow  interaction  cycle,  and  cycle  averaged  results  are 
presented  as  a  function  of  Re.  A  simplified  analytical  model  is  formulated  to  aid  in  the  interpretation  of  the  Yp  results. 
A  provisional  conclusion  is  that  while  flow  separation  causes  a  dramatic  loss  of  turbine  power,  the  competing  effects 
of  decreased  flow  deflection  and  decreased  drag  combine  so  that  Yp  is  not  altered  as  much  as  had  been  anticipated 
mitially.  The  identification  of  these  unsteady  phenomena,  along  with  unanticipated  swings  in  total  pressure,  provide 
substance  for  continued  investigation. 
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DEVELOPMENT  OF  A  PROBABILISTIC 
ASSESSMENT  FRAMEWORK  FOR  HIGH  CYCLE  FATIGUE 
FAILURES  OF  GAS  TURBINE  ENGINE  BLADES 


M.“H.  Herman  Sheii 
Associate  Professor 
Dept,  of  Aerospace  Engineering, 
Applied  Mechanics,  &;  Aviation 
Tlie  Ohio  State  University 


Abstract 

A  framework  for  the  probabilistic  analysis  of  high  cycle  fatigue  is  developed.  The 
framework  will  be  useful  to  U.S.  Air  Force  and  aeroengine  manufacturers  in  the  design  of 
high  cycle  fatigue  in  disk  or  compressor  components  fabricated  from  Ti-6A1-4V  under  a 
range  of  loading  conditions  that  might  be  encountered  during  service.  The  main  idea  of  the 
framework  is  to  characterize  vibratory  stresses  from  random  input  variables  (uncertainties 
such  as  initial  crack  size,  crack  location,  loading,  material  properties,  and  manufacturing 
variability).  The  characteristics  of  such  vibratory  stresses  will  be  portrayed  graphically  as 
histograms,  or  probability  density  function  (PDF).  The  outcome  of  the  probability  mea¬ 
sures  associated  with  all  the  values  of  a  random  variable  exceeding  the  material  capability 
is  achieved  by  a  failure  function  ,9(X)  defined  by  the  difference  between  the  vibratory  stress 
and  Goodman  line  or  surface  such  that  the  probability  of  HCF  failure  is  Pj  =  P[g{X.)  <  ()]. 
The  framework  can  be  used  to  facilitate  the  development  design  tools  for  the  prediction  of 
inspection  schedules  and  reliability  in  aeroengine  components.  Such  tools  could  lead  ulti¬ 
mately  to  improved  life  extension  schemes  in  aging  aircraft,  and  more  reliable  methods  for 
the  design  and  inspection  of  critical  components. 
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DEVELOPMENT  OF  A  PROBABILISTIC 
ASSESSMENT  FRAMEWORK  FOR  HIGH  CYCLE  FATIGUE 
FAILURES  OF  GAS  TURBINE  ENGINE  BLADES 


M.-H.  Herman  Shen 


Introduction 


The  gas  turbine  engines  operating  environment  of  nonuniform  flow  field  repeated  load¬ 
ing  creates  conditions  that  are  favorable  to  high  cycle  fatigue  (HCF)  degradation  of  metal¬ 
lic  rotating  components.  This  type  of  component  demands  durability,  high  reliability,  light 
weight,  and  high  performance.  Therefore,  lifetime  failure-free  design  criteria  based  on  Good¬ 
man  Diagram  has  been  adopted  by  the  aircraft  engine  design  community  for  ensuring  safety 
of  critical  structural  components. 

For  example,  a  traditional  HCF  turbine  blading  system  design  procedure  is  shown 
schematically  in  Figure  1.  This  design  process  usually  consists  of  a  structural  dynamics 
analysis  to  determine  natural  frequencies  and  mode  shapes  at  certain  operating  speed  ranges 
and  a  stress  analysis  to  calculate  dynamic  stress  distribution  for  identifying  maximum  vibra¬ 
tory  stress  location  or  area  under  a  series  of  given  excitations.  Once  the  maximum  stresses 
for  each  vibration  mode  are  determined,  high  cycle  fatigue  assessment  can  be  achieved  by 
measuring  the  margiti  between  the  maximum  vibratory  stress  and  the  material  fatigue  ca¬ 
pability  which  is  a  straight  line  (so  called  Goodman  line  in  this  study)  drawn  between  the 
mean  ultimate  strength  at  zero  vibratory  stress  and  mean  fatigue  strength  at  10^  cycles  (or 
infinite  life). 

Therefore,  the  traditional  HCF  design  procedure  is  usually  established  by  using  the 
results  of  a  simple  deterministic  stress  analysis  without  take  into  account  the  information 
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such  as  degradation  of  material  properties,  scatter  in  fatigue  life  testing,  and  uncertainties 
inherent  in  the  operating  conditions  in  the  real  world. 

In  turn,  as  it  has  been  reported,  a  number  of  structural  failures  have  occurred  on 
aircraft  engines  during  development  testing  and  operational  service.  These  engines  were 
optimally  designed  and  assumed  to  be  valid  during  their  service  life.  These  incidents  trig¬ 
gered  an  awareness  of  the  fact  that  current  aircraft  engine  critical  structural  components, 
blading  system  in  particular,  although  they  satisfy  the  lifetime  failure-free  design  criteria, 
sometimes  fail. 

Failure  Analysis  of  Gas  Turbine  Blades 

Gas  turbine  blade  failures  are  generally  resulting  from  high  vibratory  stress  due  to 
excessive  vibration  in  blading  during  forced  response.  Forced  response  usually  occurs,  under 
a  nonuniform  flow  field  operating  environment,  when  the  excitation  frequency  from  unsteady 
aerodynamic  load  is  coincident  with  any  blade’s  natural  frequency.  Hence,  an  adequate 
blading  design  will  be  the  case  if  all  the  resonant  conditions  could  be  avoided.  Realistically, 
it  is  impossible  due  to  the  fact  that  insufficient  real  world  loading  information  provided  at 
the  time  of  the  analysis  and  design. 

Since  the  effects  of  randomness  in  real-world  operating  environment  cannot  be  quan¬ 
tified  accurately  by  a  traditional  deterministic  design  approach  as  shown  in  Figure  1,  it 
is  beyond  the  capability  of  such  approach  to  provide  the  important  information  regarding 
risk  of  failure,  or  the  sensitivities  of  the  design  variables  to  risk.  Hence,  in  this  work,  a 
probabilistic  prediction  procedure  as  presented  in  Figure  2,  which  accounts  for  variability 
in  geometry  and  aerodynamic  loading  has  been  developed  to  calculate  the  important  design 
and  maintenance  information  of  sensitivity,  (high  cycle)  fatigue  reliability,  and  associated 
risk.  In  addition,  in  this  study,  the  reliability  due  to  the  variability  in  materials  (e.g.  dam¬ 
age,  cracks,  degradation,  etc.)  will  be  discussed. 
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Blade  Vibration  Model 


An  analytical  procedure  has  been  obtained  by  Shen  [1]  for  modeling  the  behavior  of 
rotating  blades  with  taking  into  account  the  effects,  so-called  ’’centrifugal-softening  and 
centrifugal-stiffing”,  of  the  displacement-dependent  centrifugal  forces.  The  procedure  pro¬ 
vides  close  form  equations  of  motion  for  a  rotating  flat  uniform  blade  in  flapping  motion 
and  lead-log  motion  as 


EIz{w””  +  0x)  +  mib  +  ==  0,  (1) 

=  (2) 

4-  /?'")  —  mv  +  —  x^)v”  =  0,  (3) 

j  y  2 

£;4(u'" +  /?")-  =  (4) 


Here,  u,  w  are  the  displacements,  Py  are  the  shears  angles,  k  is  the  shear  correction 
factor,  and  E,  G  are  the  effective  blade  bending  and  shear  moduli,  respectively. 

For  high  cycle  fatigue  applications,  in  accordance  with  the  above  analytical  procedure, 
a  3-D  finite  element  model  (Figure  3)  was  developed  [2]  through  classical  variational  prin¬ 
ciples.  The  finite  element  model  can  be  used  in  developing  the  HCF  reliability  model. 

HCF  Reliability  Model 

HCF  is  a  result  of  repeated  load,  but  well  below  the  yield  strength  of  the  material 
cycling,  and  the  consequential  damage,  places  restrictions  on  the  usable  lifetime  of  gas 
turbine  engines  experience  these  loads.  The  cycling  can  be  mechanical  (e.g.  vibration), 
aeroelastic  (e.g.  flutter,  resonance)  or  environmental  (e.g.  thermal),  and  in  its  early  stages 
the  accumulating  damage  is  often  very  difficult  to  detect.  The  mechanical  or  aeroelastic 


29-  5 


cycling  due  to  variability  in  geometry,  aerodynamics,  and  materials  cannot  be  predicted 
with  certainty,  and  must  be  considered  a  random  process.  The  effects  of  this  randomness 
cannot  be  quantified  accurately  by  a  deterministic  analysis,  and  loading  must  therefore  be 
treated  as  a  random  variable. 

During  the  design  phase  or  service  life,  these  uncertainties  or  raiulom  variables  can  be 
combined  with  other  operating  experience  from  tests  and  flights  to  calculate  reliability  and 
risk  using  a  probabilistic  analysis  procedure.  The  main  idea  of  the  probabilistic  procedure 
is  to  characterize  vibratory  stresses  from  random  input  variables  (uncertainties  such  as  ini¬ 
tial  crack  size,  crack  location,  loading,  material  properties,  and  manufacturing  variability). 
The  characteristics  of  such  vibratory  stresses  will  be  portrayed  graphically  as  histograms, 
or  probability  density  function  (PDF). 

Failure  State  Function 

The  outcome  of  the  probability  measures  associated  with  all  the  values  of  a  random 
variable  below  a  target  value  is  a  cumulate  distribution  function  (CDF).  For  example,  in 
structural  reliability  analysis,  a  PDF  or  a  failure  state  function  or  limit  state  function 
^(X)  needs  to  be  defined  by  the  difference  between  the  vibratory  stress  (cr)  and  Goodman 
line  or  surface  {(p)  in  terms  of  a  vector  of  basic  random  variables  (or  uncertainties),  X  = 

g{X)=4>-cj{X.)  (5) 


such  that  the  CDF  or  the  probability  of  failure  is 


Pf  =  P[g{X)  <  0] 


(C) 


Example:  Structural  Reliability  of  Turbine  Blades 
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This  example  illustrates  a  general  procedure  for  predicting  gas  turbine  blade’s  struc¬ 
tural  reliability  and  associate  risk  when  high  vibratory  stress  fluctuation  develop  due  to 
uncertainties  such  as  manufacturing  variability  and  other  stress  raisers  that  arise  when  the 
excitation  frequency  from  unsteady  aerodynamic  load  is  coincident  with  any  blade’s  natural 
frequency. 

Suppose  that  a  high  vibratory  stress  fluctuation  is  developed  in  a  rotating  gas  turbine 
blade  and  progressively  increases  along  a  particular  stress  ratio  R.  The  first  step  in  avoiding 
the  catastrophic  failure  of  the  entire  blading  system  is  estimating  the  reliability  or  the 
probability  of  failure  of  the  blade  when  high  vibratory  stress  appears. 

As  it  has  been  discussed  above,  the  reliability  estimation  procedure  consists  of  deter¬ 
mining  the  probability  density  function  (PDF)  and  calculating  the  cumulative  probability 
of  failure  (CDF)  for  crack  propagation  through  FPI  process. 

If  we  choose  the  vibratory  stress  a  as  the  probability  measure,  the  performance  function 
of  FPI  can  be  written  as 

giX)  =  <t>-aiX.)  (7) 

where  is  the  Goodman  line  or  Goodman  surface.  In  this  study,  the  vibratory  stress  fluctu¬ 
ation  is  assumed  to  be  controlled  by  the  uncertainties  that  associate  with  the  manufacturing 
variability  and  characterized  by  structural  damping  variation.  The  vibratory  stress  at  the 
various  high  frequency  modes  is  calculated  using  the  3-D  finite  element  model  [2]. 

An  example  of  a  gas  turbine  blade  that  was  tested  is  mad(;  of  Ti-6A1-4V  having  di¬ 
mensions:  3.0  inch  wide  x  7.0  inch  long.  The  Haigh  or  Goodman  diagram  of  the  Ti-6A1-4V 
as  shown  in  Figure  4  is  obtained  from  the  study  by  Maxwell  and  Nicholas  [3].  These  data 
will  be  used  to  form  the  failure  criteria  or  failure  boundary  (j)  in  the  failure  state  function  g 
for  the  prediction  of  HCF  reliability  and  risk. 
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A  finite  element  analysis  has  been  conducted  for  the  damped  free  and  force  vibrations 
analysis  of  the  non-rotating  and  rotating  blades  using  the  blade  model  as  shown  in  Figure 
3.  In  accordance  with  the  testing  data  available  in  the  public  domain  [4],  highest  vibratory 
stresses  are  usually  occurred  on  the  high  frequency  stripe  vibration  modes.  Hence,  in  this 
study,  the  proposed  reliability  prediction  procedure  is  demonstrated  on  the  third  stripe 
modes  only. 

As  shown  in  Figure  5,  the  third  (11925.1  Hz)  free  stripe  vribation  mode  (3s)  of  the  blade 
are  calculated  with  damping  loss  factor  (^)  0.0077  (inverse  of  the  damping  capability).  Next, 
let’s  consider  the  cases  of  the  blade  under  a  concentrated  external  harmonic  excitation  force 
and  a  static  load  applying  at  center  portion  of  the  blade.  The  magnitudes  of  the  excitation 
force  and  the  static  load  are  allowed  to  be  adjusted  for  achieving  appropriate  stress  ratios 
and  vibratory  stress  ranges. 

The  3rd  (11925  Hz)  stripe  mode  forced  response  was  computed  for  various  loss  factors 
^  obtained  from  the  normally  distributed  testing  data  set  with  0.0077  mean  value  and 
the  0.0025  standard  deviation.  The  mode  shape  and  the  corresponding  principle  stress 
distribution  are  shown  in  Figures  6  and  7,  One  should  note  that  the  mode  shape  is  in  close 
agreement  with  the  free  vibration  mode  shape  in  Figure  5,  and  tlie  maximum  von  Mises 
vibratory  stress,  as  expected,  occurs  near  the  trailing  edge  of  the  blade. 

Figure  8  shows  the  von  Mises  vibratory  stress  distributions  cr(0  calculated  through 
finite  element  analysis  under  various  values  of  damping  at  stress  ratio  (R)  equation  to  - 
1.  The  distributions  are  presented  in  the  form  of  probabilistic  density  function  (PDF), 
the  probability  of  stress  occurrence,  for  reliability  prediction.  With  the  vibratory  stress 
distributions  cr(0  the  probability  of  failure,  p/,  reliability  prediction  has  been  accomplished 
via  the  formulation 

oo 

Pf  =  j  (T{0<l>d.(Tv  (8) 

—  OO 
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and  qualitatively  represented  by  the  interference  between  the  PDF  curve  and  the  Goodman 
surface  (due  to  the  scatter  in  testing  data).  In  other  words,  the  larger  the  overlapped  area 
indicates  the  higher  the  failure  probability. 

However,  in  general,  such  formulation  is  very  difficult  to  evaluate  not  only  because  of 
the  multiple  integration,  but  the  joint  probability  distribution  may  not  even  be  available. 
Therefore,  a  numerical  method  (FPI)  [5]  is  employed  in  this  study  to  calculate  an  approxi¬ 
mation  of  the  probability  of  failure  p y  distribution  (a  cumulative  distribution  function  CDF) 
as  shown  in  Figure  9. 

Probabilistic  Goddman  Diagram 

With  the  vibratory  stress  distributions  <t(^)  calculated  through  finite  element  analysis 
for  various  values  of  stress  ratio  R  and  alternating  stress,  the  probability  of  failure,  p/  is 
qualitatively  represented  by  the  intersection  of  the  vibratory  stress  distribution  and  the 
Goodman  line  as  shown  in  the  3-dimensional  plot  on  Figure  10.  In  turn,  a  new  reliabil¬ 
ity  design  diagram,  which  is  named  the  probabilistic  Goodman  Diagram,  is  constructed  by 
plotting  the  probability  of  failure,  p/  locus  on  the  original  Goodman  diagram.  Furthermore, 
since  the  reliability  of  the  blade  can  be  defined  as  the  probability  of  non-failure,  and  is  given 
by  Rs  =  I  —  Pf,  the  probabilistic  Goodman  Diagram  is  then  extended  to  include  the  Rg 
locus  as  shown  in  Figure  10. 

Conclusion 

A  probability-bjused  risk  assessment  framework  has  been  developed  to  predict  reliabil¬ 
ity  of  gas  turbine  engine  blades  subjected  to  high  cycle  fatigue.  The  procedures  provides  a 
systemic  approach  for  predicting  and  designing  turbomachinery  blading  reliability  against 
various  potential  fatigue  problems  for  all  relevant  vibratory  modes  and  taking  into  account 
variability  in  geometry  (e.g.  dimensional  variation,  mistuning,  etc.),  unsteady  aerodynam¬ 
ics,  structural  damping,  and  thermal  loading.  The  variability  in  materials  (e.g.  damage, 
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cracks,  degradation,  etc.)  can  be  also  considered  in  this  approach.  A  reliability  prediction 
was  performed  on  gas  turbine  blades  at  high  frequency  modes  (e.g.  third  stripe  mode) 
using  a  probabilistic  vibratory  stress  distribution  in  conjunction  with  the  Haigh  or  modified 
Goodman  Diagrams.  The  cumulative  reliability  {Rs)  and  probability  of  failures  {p/))  are 
then  calculated  using  the  fast  probability  integration  (FPI)  technique  to  construct  a  novel 
probabilistic  Goodman  (or  Haigh)  diagram  which  provides  blade’s  lifetime  design  guide  line 
and  an  optimal  maintenance  strategy  in  management  in  decision-making  relating  to  the 
PM/inspection  scheduling,  replacement,  spare  parts  requirements. 

References 

[1] .  M.-H.  H.  Shen,  “Passive  Vibration  Suppression  of  Rotating  Blades  Using  Piezoelectric 
Devices”,  under  review,  Journal  of  Smart  Materials  and  Structures. 

[2] ,  M.-H.  H.  Shen,  “Reliability  High  Cycle  Fatigue  Design  of  Gas  Turbine  Blades  Using 
the  Probabilistic  Goodman  Diagram”,  under  review.  International  Journal  of  Fatigue. 

[3] .  D.C.  Maxwell  and  T.  Nicholas,  “A  Rapid  Method  for  Generation  of  a  Haigh  Diagram 
for  High  Cycle  Fatigue”,  Submitted  to  ASTM  for  review,  1997. 

[4] .  A.J.A.  Mom  and  M.D.  Raizenne,  “AGARD  Engine  Disc  Cooperative  Test  Program”, 
AGARD-R-766,  1988. 

[5] .  M.-H.  H.  Shen  and  M.  S.  Shen,  1994,  “Probability  of  Failure  and  Risk  Assessment  of 
Structures  with  Fatigue  Cracks”,  AIAA  Journal,  Vol.  32,  No.  12,  pp,  2447-2455. 

Acknowledgment 

The  author  is  grateful  to  Dr.  Ted  Nicholas  for  his  valuable  discussions  and  expert 
suggestions. 


29-10 


Defects  &  Damages 
.  FOD 
.  Cracks 


29-  11 


Figure  1.  Traditional  HCF  Design  Process 
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Figure  2.  Proposed  Reliability  HCF  Design  Process 
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Figure  3;  The  Finite  Element  Mesh  of  OSU-ShenYen-Blade#2 
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Figure  4:  Hiagh  (or  Goodman)  Diagram  for  Ti-6A1-4V 
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Figure  7.  Vibratory  Stress  (von  Mises)  at  Srd  Stripe  Mode 
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Figure  8.  Vibratory  Stress  and  Material  Variability  Distributions  (R=-l) 
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Figure  9.  Cumulative  Probatylity  that  Vibratory  Stress  >  Material  Capability. 
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Figure  10:  Proposed  3-dimensional  probabilistic  Goodman  diagram 
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A  STUDY  OF  MODELS  AND  TOOLS  FOR 
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Abstract 

A  recent  trend  in  dataflow  architecture  design  is  incorporation  of  dataflow  concept  with  multithreading 
concept.  The  VGI  parallel  computer  is  a  multithreaded  static  dataflow  computer  intended  for  processing 
stream  data  present  in  video,  graphics,  and  image  processing  applications.  The  massively  parallel  VGI 
computer  consists  of  a  large  number  of  processors  connected  through  reconfigurable  communication  networks. 
Two  connected  processors  use  a  simple  handshake  protocol  for  communication  and  synchronization. 

This  report  focuses  on  the  issues  of  programming  models,  tools,  and  compilation  technology  for  program¬ 
ming  the  VGI  parallel  computer.  The  issues  are  discussed  at  three  programming  levels:  machine  level,  signal 
flow  graph  level,  and  SISAL  language  level.  Future  research  work  is  also  discussed  in  the  report. 
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A  STUDY  OF  MODELS  AND  TOOLS  FOR 
PROGRAMMING  THE  VGI  PARALLEL  COMPUTER 


Hongchi  Shi 


Introduction 

The  dataflow  approach  to  parallel  computing  has  been  attractive  to  many  researchers  due  to  its  simplicity 
and  elegance  in  specifying  parallelism  and  data  dependencies  [2,  8,  7,  10,  6].  The  pure  dataflow  model 
allows  parallel  and  pipelined  execution  at  the  level  of  individual  instructions.  Pure  dataflow  machines 
are  theoretically  able  to  exploit  all  the  available  program  parallelism.  However,  scheduling  each  instruction 
separately  and  synchronizing  on  each  data  value  lead  to  a  large  amount  of  overhead  [10, 1,11].  A  recent  trend 
in  the  design  of  dataflow  machines  is  incorporation  of  von  Neumann  architectural  features  and  multithreading 
concept  into  dataflow  architectures  to  reduce  the  overhead  incurred  from  synchronization  and  instruction 
scheduling  at  the  instruction  level  [4,  7,  10,  6].  In  such  hybrid  dataflow  machines  that  integrate  dataflow 
and  control  flow  models,  the  basic  scheduling  and  synchronization  unit  is  a  thread  which  is  a  sequence  of 
instructions. 

Following  the  ideas  of  dataflow  and  multithreading,  Srini  and  Rabaey  have  developed  a  multiprocessor 
stream  data  processing  computer  [14, 15, 16, 13].  The  parallel  stream  data  processing  computer  is  named  VGI 
and  designed  for  video,  graphics,  and  image  processing  applications.  A  VGI  computer  may  contain  several 
VGI  chips  connected  using  the  I/O  processors  in  the  chips.  Each  chip  contains  16  clusters  of  processors.  Each 
cluster  has  four  computational  processors,  one  memory  processor,  and  one  I/O  processor.  The  processors 
are  interconnected  through  two  levels  of  segmented  buses.  A  handshake  protocol  is  used  for  synchronization 
between  two  communicating  processors.  Each  computational  processor  has  its  own  instruction  memory 
for  instructions  processing  streams  of  data  present  in  video,  graphics,  and  image  processing  applications. 
Each  memory  processor  can  be  statically  configured  as  a  delay  line,  FIFO,  lookup  table,  or  RAM.  The  I/O 
processors  are  designed  for  external  communication.  From  a  more  abstract  perspective,  a  VGI  computer 
contains  a  set  of  interconnected  processors  that  each  repeatedly  execute  a  thread.  The  synchronization 
between  two  processors  is  implemented  such  that  it  happens  only  when  the  source  processor  is  ready  to 
send  and  the  destination  processor  is  ready  to  receive.  The  handshake  technique  for  synchronization  is 
much  simpler  than  the  traditional  token  matching  technique  and  reduces  the  overhead  significantly.  Once 
the  VGI  computer  is  configured,  the  connections  among  processors  and  the  threads  on  the  processors  are 
fixed  until  the  computer  is  reconfigured.  Threads  are  statically  scheduled  to  processors  during  configuration 
for  execution.  Each  thread  usually  receives  inputs  from  its  predecessors;  performs  some  operations  on  the 
inputs;  and  outputs  the  results  to  its  successors.  The  predecessors  and  successors  are  statically  determined 
by  the  connections. 

The  eventual  success  of  the  VGI  computer  will  depend  on  its  programmability.  In  general,  writing  parallel 
programs  is  currently  much  more  difficult  than  writing  sequential  programs  [1].  The  main  difficulties  are  how 
to  determine  which  tasks  execute  and  when,  how  tasks  communicate  and  synchronize,  and  how  to  configure 
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the  computer  to  accomplish  the  work  efficiently.  Despite  the  VGI  architecture  supports  well  parallelism  at 
thread  level  and  provides  simple  synchronization,  it  is  still  a  challenge  to  efficiently  map  algorithms  onto  the 
underlying  VGI  computer.  The  users  have  to  worry  about  how  to  assign  tasks  to  each  processor  and  how 
to  connect  processors  through  switch  settings.  Writing  a  parallel  program  by  explicitly  specifying  how  to 
handle  these  issues  is  extremely  complicated.  To  relieve  the  programmers  from  the  demanding  undertaking, 
abstract  programming  models  and  tools  have  to  be  developed  for  parallel  programming. 

This  report  focuses  on  the  issues  of  programming  models,  tools,  and  compilation  technology  for  program¬ 
ming  the  VGI  parallel  computer.  We  organize  the  rest  of  the  report  as  follows.  In  the  next  section,  we  give 
an  overview  of  the  VGI  parallel  computer.  In  the  following  sections,  we  discuss  programming  models  and 
tools  for  the  VGI  computer.  We  also  use  a  simple  example  to  illustrate  programming  at  different  levels.  We 
conclude  the  report  with  brief 

conclusions  and  future  work  in  the  last  section. 

VGI  Parallel  Computer 

The  VGI  computer  is  based  on  the  concept  of  processing  data  streams  as  they  are  generated  from  sensors 
for  applications  involved  with  video,  graphics,  and  image  processing.  It  consists  of  VGI  chips,  bus  interface 
chips,  and  SRAM  chips  on  a  single  board  or  in  a  multiple  board  box  and  usually  works  as  a  co-processor 
controlled  by  a  host  such  as  a  workstation,  PC,  or  PDA.  The  VGI  chips  can  be  connected  in  many  different 
ways  through  the  I/O  processors  in  each  chip  and  interconnection  networks. 

The  core  of  the  VGI  computer  is  the  stream  data  processing  chips  referred  to  as  VGI  chips.  The  VGI 
chip  is  a  multiple  instruction  multiple  data  stream  (MIMD)  parallel  processing  chip  and  operates  at  66  MHz 
clock  rate.  It  is  composed  of  16  clusters,  each  consisting  of  four  VGI  processors,  one  memory  processor, 
and  one  I/O  processor  which  are  connected  through  a  local  communication  network  referred  to  as  level- 
1  network.  The  level-1  networks  are  connected  through  the  global  level-2  network.  Thus,  the  VGI  chip 
has  64  VGI  processors,  16  memory  processors  for  supporting  on-chip  data  memory,  16  input/output  (I/O) 
processors  for  communication  with  external  memory  and  other  VGI  chips,  and  a  crossbar-like  interconnection 
network  which  interfaces  between  the  VGI  processors,  memory  processors,  and  I/O  processors,  as  shown  in 
Figure  1.  The  chip  can  be  rapidly  configured  through  a  scan  chain  that  reaches  all  the  processors.  The 
scan  chain  is  used  to  initialize  the  local  instruction  memory  in  each  VGI  processor,  the  data  memory  in 
each  memory  processor,  the  I/O  connections  in  each  I/O  processor,  and  the  switches  in  the  communication 
networks  which,  when  configured  using  the  scan  chain,  allow  both  point-to-point  and  simple  broadcast  types 
of  communication. 

VGI  Processor 

The  VGI  processor  is  simple  in  architecture  as  shown  in  Figure  2.  There  is  a  small  instruction  memory  that 
can  store  16  single-word  instructions.  Thus,  a  VGI  processor  program  can  be  at  most  16  instructions  long. 
The  VGI  processor  has  a  controller,  a  16-bit  data  path,  and  an  ALU.  It  supports  arithmetic  and  logical 
operations  (including  multiplication),  branching,  and  byte  manipulation. 
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Figure  1:  The  major  components  of  the  VGI  chip 


There  are  6  programmable  registers  in  each  VGI  processor  which  can  also  be  configured  as  3  queues  as 
shown  in  Figure  3.  The  six  registers  are  divided  into  three  groups  of  two  registers  each.  Each  group  of 
two  registers  can  be  individually  configured  as  either  two  random  access  registers  or  a  two-element  first-in 
first-out  queue.  Queues  are  used  for  communication  with  other  processors.  As  queues  they  obtain  inputs 
through  the  communication  networks.  As  registers  they  can  be  initialized  through  the  scan  chain  during 
configuration. 

Memory  Processor 

It  is  known  that  off-chip  communication  is  expensive  in  terms  of  power,  time  delay,  and  chip  area.  With 
on-chip  memory,  the  memory  bandwidth  can  be  increased  and  off-chip  traffic  can  be  reduced.  Each  cluster 
of  the  VGI  chip  has  one  memory  processor  which  contains  a  256  x  16  memory.  Like  the  VGI  processors, 
the  memory  processor  is  connected  to  the  level-1  network  and  has  a  three-stage  pipeline  comprising  fetch, 
memory  access,  and  communicate.  The  fetch  and  communicate  stages  are  used  to  synchronize  the  memory 
processor  with  the  VGI  processors.  The  architecture  of  the  memory  processor  is  as  shown  in  Figure  4. 

The  memory  processor  is  the  interface  between  other  processors  and  the  on-chip  memory.  It  is  designed 
to  implement  four  types  of  commonly  used  memory  accesses  in  signal  processing:  RAM,  lookup  table,  FIFO, 
and  delay  line.  When  configured  as  a  RAM,  the  memory  processor  can  execute  any  read  or  write  operation. 
A  read  or  write  request  is  sent  through  the  corresponding  handshake  line.  The  lookup  table  mode  is  similar 
to  the  RAM  mode  except  that  write  is  not  allowed  during  execution  and  the  table  in  the  memory  array 
is  loaded  through  the  scan  chain  during  configuration.  In  the  FIFO  mode,  addresses  for  read  and  write 
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come  from  the  read  and  write  pointers.  The  write  data  is  from  the  communication  networks.  Read  requests 
are  communicated  using  read  handshake  lines,  and  write  requests  are  communicated  using  write  handshake 
lines.  The  delay  line  mode  is  similar  to  the  FIFO  mode  except  that  read  is  executed  automatically  once  the 
delay  line  is  full. 

I/O  Processor 

The  I/O  processor  handles  communication  with  off-chip  devices  such  as  external  memory  and  other  VGI 
chips.  To  accommodate  commercial  SRAM  chips  and  other  VGI  chips,  the  I/O  processor  uses  two  cycles  for 
data  and  control  communication.  Figure  5  shows  the  block  diagram  of  the  I/O  processor. 

The  data  buses  in  the  I/O  processor  Ccui  be  connected  to  the  level-1  network  and  the  I/O  pad.  The  I/O 
processor,  configured  through  the  scan  chain,  either  takes  data  from  an  on-chip  VGI  processor  and  outputs 
to  a  pad,  or  the  other  way  around.  If  the  I/O  processor  is  supposed  to  take  data  from  the  pad,  it  outputs 
the  data  over  the  output  control  bus  and  one  of  the  last  two  output  data  buses.  If  the  I/O  processor  is  set 
up  to  output  data  to  the  pad,  then  it  takes  the  data  from  the  input  control  and  data  buses. 

Communication  Networks 

The  VGI  processor  has  three  data  output  ports  connected  to  the  three  output  buses  and  three  data  input 
ports  connected  to  the  three  queues.  It  also  has  two  input  ports  and  one  output  port  for  control  data  used  for 
communicating  conditions  such  as  carries  for  pipelining  some  complicated  operations  such  as  multiplication. 
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Figure  4:  The  memory  processor 
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Figure  5:  The  I/O  processor 


The  memory  processor  also  has  three  data  input  ports  and  three  data  output  ports,  but  does  not  have 
control  input/output  ports.  The  I/O  processor  has  three  data  input  ports  and  three  data  output  ports.  It 
also  has  two  control  input  ports  and  one  control  output  port.  All  the  ports  are  associated  with  corresponding 
handshake  lines  for  synchronization.  The  processors  can  be  abstracted  as  shown  in  Figure  6  for  the  purpose 
of  connecting  processors  to  implement  an  algorithm. 
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Figure  6:  A  processor  with  input/output  ports 

Each  of  the  VGI  and  I/O  processors  in  a  cluster  has  17  programmable  switches  for  connecting  data  and 
control  ports  to  the  level-1  network.  The  memory  processors  have  fewer  switches  since  they  do  not  have 
control  ports.  There  are  6  data  buses  and  4  control  buses  in  the  level-1  network.  There  are  also  3  internal 
bypass  buses  connecting  neighboring  processors.  The  switches  are  numbered  in  Figure  7. 
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Figure  7:  The  level-1  switches  in  each  cluster 


For  processors  in  one  cluster  to  communicate  with  processors  in  another  cluster,  the  level-2  communication 
network  is  provided.  The  level-2  network  consists  of  16  16-bit  data  buses  and  8  2-bit  control  buses  with 
their  associated  handshake  lines.  The  level-1  and  level-2  buses  form  a  crossbar  with  programmable  switches 
at  the  crossings  of  the  level-1  and  level-2  buses.  The  switches  allow  connections  of  the  level- 1  and  level-2 
buses.  The  50  switches  for  each  cluster  are  set  through  the  scan  chain.  Figure  8  shows  the  switches  for  the 
connection  of  an  upper  cluster  and  a  lower  cluster  to  the  level-2  buses. 
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Figure  8:  The  level-2  switches  for  each  upper  and  lower  cluster  pair 


Machine  Level  Programming 

The  lowest  level  programming  is  at  the  machine  level.  At  this  level,  the  programmer  is  aware  of  all  the 
physical  processors  and  the  switches  that  can  be  set  to  form  paths  for  connecting  the  processors. 

Programming  Model 

The  machine  level  programming  model  abstracts  the  VGI  computer  for  the  purpose  of  programming  the 
computer. 
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Each  VGI  processor  can  implement  some  function  that  transforms  every  set  of  data  items  from  its 
predecessors  into  a  set  of  data  items  for  its  successors.  The  function  is  fixed  for  every  set  of  inputs  and  is 
usually  repeated  in  a  loop.  The  function  should  be  simple  enough  to  be  implemented  in  at  most  16  VGI 
instructions.  Unlike  the  VGI  processors,  the  memory  and  I/O  processors  work  in  some  mode  among  a  few 
predefined  modes.  The  specific  mode  is  chosen  during  configuration  and  fixed  during  execution. 

A  processor  can  send  data  items  to  other  processors  through  its  three  output  ports  and  can  receive  data 
items  from  other  processors  through  its  three  input  ports.  An  output  port  of  a  processor  can  be  connected 
to  an  input  port  of  another  processor  by  setting  switches  to  form  a  path  between  the  output  port  and  the 
input  port.  The  connection,  after  configured,  is  fixed  during  execution.  Two  communicating  processors  have 
to  wait  until  the  data  is  transferred  from  the  sender  to  the  receiver.  The  transfer  only  takes  one  cycle  when 
both  the  sender  and  the  receiver  are  ready.  There  are  also  two  input  ports  and  one  output  port  in  the  VGI 
and  I/O  processors  for  communicating  control  data  such  as  carries. 

To  implement  a  solution  to  a  stream  data  processing  problem,  the  programmer  has  to  break  up  the 
solution  into  a  sequence  of  tasks,  each  transforming  some  input  streams  into  some  output  streams.  Each 
task  is  assigned  to  a  physical  processors.  If  an  output  stream  of  task  A  assigned  to  processor  A  is  an  input 
stream  of  task  B  assigned  to  processor  B,  there  should  be  a  connection  between  the  output  port  of  processor 
A  for  the  output  stream  of  task  A  and  the  input  port  of  processor  B  for  the  input  stream  of  task  B.  The 
tasks  are  then  implemented  using  the  VGI  instructions.  The  switch  settings  for  processor  connections  and 
the  assignment  of  tasks  to  processors  are  stored  in  a  file  as  the  VGI  program  for  the  stream  data  processing 
problem. 

Three  tedious  processes  are  involved  in  the  above  programming  paradigm  after  the  programmer  divides 
the  algorithm  into  tasks  and  figures  out  the  relationships  among  tasks  in  terms  of  inputs  and  outputs.  The 
first  process  is  to  assign  the  tasks  to  the  processors.  The  second  process  is  to  determine  which  data  stream 
goes  through  which  port.  The  third  process  is  to  find  switches  that  form  a  path  to  connect  an  output  port 
of  a  processor  to  an  input  port  of  another  processor.  Since  assignment  of  processors  and  input/output  ports 
affects  the  availability  of  paths  between  communicating  processors,  the  above  processes  may  have  to  be 
repeated  with  different  processor  and  input/output  port  assignments  to  find  a  valid  assignment.  These  three 
processes  affect  each  other,  making  the  machine  level  programming  very  difficult.  There  is  a  need  for  tools 
to  keep  track  of  resources  and  assist  the  routing  process.  Figure  9  shows  the  tools  and  information  flow  in 
machine  level  programming. 

Tools 

Since  processors  in  each  cluster  have  more  connections  among  themselves,  a  good  heuristic  for  processor 
assignment  is  to  assign  tasks  with  more  connections  among  themselves  to  the  same  cluster.  Finding  paths 
can  be  assisted  with  a  routing  tool.  The  routing  tool  searches  the  currently  available  switches  to  form  a  path 
between  an  output  port  of  a  processor  and  an  input  port  of  another  processor.  It  shows  the  possible  paths  for 
the  programmer  to  choose.  Since  the  paths  with  switch  numbers  are  hard  to  imagine,  it  is  better  to  display 
all  the  switches  in  the  level-1  and  level-2  networks  and  mark  the  paths  already  taken  for  the  programmer. 
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Figure  9:  Machine  level  programming  tools  and  information  flow 

This  will  also  allow  the  programmer  to  interactively  select  switches  to  form  paths. 

After  the  processors,  input/output  ports,  paths  are  determined,  the  programmer  then  writes  the  assembly 
code  for  each  processor.  The  code  for  each  processor  can  be  assembled  using  the  VGI  assembler. 

The  switch  settings  for  the  networks  and  the  assembled  code  for  each  processor  are  put  together  to 
generate  a  scan  chain  bit  stream  for  configuration  of  the  VGI  computer. 

A  VGI  simulator  has  been  under  development  for  VGI  program  execution  and  verification.  The  func¬ 
tional  simulator  is  designed  to  give  fast  feedback  to  the  programmer  in  the  trial-and-error  machine  level 
programming. 

A  Programming  Example 

To  illustrate  further  the  machine  level  programming  paradigm,  we  describe  in  details  the  development  of  a  ID 
convolution  on  the  VGI  computer.  The  ID  convolver  computes  the  output  signal  stream,  bi,  i  =  0, 1, 2,  •  •  •, 
from  the  input  signal  stream,  ai,i  =  0, 1, 2, •  •  •,  by 

—  fc — iOii — 1  “I”  Ajoflj  A..^inx+i. 

We  can  divide  the  computation  of  bi  into  four  tasks.  Task  1  computes  ;  task  2  computes  koai',  task 

3  computes  fc+iOj+i;  and  task  4  accumulates  the  results  from  tasks  1,  2,  and  3.  The  input  stream  to  task  1  is 
00,01,02,  •  •  •  and  the  output  stream  is  0,  fc_iOo,  fc-iOi,  fc-i02,  •  •  -.  The  input  stream  to  task  2  is  00,01,02,  •  •  • 
and  the  output  stream  is  A:oOo,fcoOi,A:o02,----  The  input  stream  to  task  3  is  oo,0i,02,-  -  and  the  output 
stream  is  fc+iOi,  ifc+102,  ■  •  •.  Let  task  3  takes  the  input  stream  from  the  signal  input.  Task  3  can  forward  a 
copy  of  its  input  to  task  2,  and  task  2  can  forward  a  copy  of  its  input  to  task  1.  Task  4  accumulates  the  output 
streams  of  tasks  1,  2,  and  3,  and  outputs  fcoOo  -f-  fc+iOi,fc_iOo  +  fcoOi  +  fc+i02,A:_i0i  -I-  fco02  -I-  fc+103,-  •  •.  To 
synchronize  the  tasks,  task  1  needs  to  output  one  0  before  it  reads  any  input,  and  task  3  must  not  generate 
any  output  on  the  first  input.  The  tasks  and  their  relationship  are  represented  with  the  signal  flow  graph  in 
Figure  10. 

We  can  assign  the  tasks  to  one  cluster,  say  cluster  3.  We  then  assign  task  3  to  VGI  processor  0  (P0_C03), 
task  2  to  VGI  processor  1  (P1_C03),  task  1  to  VGI  processor  2  (P2.C03),  and  task  4  to  VGI  processor  3 
(P3.C03).  We  can  use  the  I/O  processor  of  cluster  (I0-C03)  to  handle  the  output.  We  need  the  I/O  processor 
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a(i+l)  b(i)=k(-  l)a(i-  l)+k(0)a(i)+k(+  l)a(i+ 1) 


Figure  10:  A  signal  flow  graph  for  the  ID  convolution 


of  another  cluster,  say  cluster  4,  to  handle  the  input.  We  also  need  to  assign  input/output  ports  for  each 
connection.  The  processor  and  input/output  port  assignment  is  shown  in  Figure  11. 


I0_C04 


I0_C03 


Input 


Output 


Figure  11:  Assignment  of  processors  and  input /output  ports  for  the  ID  convolution 

Now  we  need  to  find  a  path  connecting  the  output  port  of  the  processor  to  the  input  port  of  the  other 
processor  for  each  communicating  processor  pair.  This  process  can  be  done  with  help  of  the  routing  tool. 
For  the  connection  between  01  of  P0_C03  and  10  of  P1_C03,  the  routing  tool  finds  two  possible  paths  (using 
at  most  4  switches):  L1S4P1_C03  (level-1  switch  4  of  VGI  processor  1  in  cluster  3)  and  L1S14P0_C3  - 
L1S7P1_C03.  We  choose  the  first  path  (which  uses  the  bypass  bus  between  P0_C03  and  P1_C03).  Note  that 
switch  L1S4P1_C03  and  its  associated  bus  cannot  be  used  for  other  connections.  We  can  similarly  choose 
paths  for  other  processor  pairs.  The  paths  are  summarized  in  Table  1. 

The  easiest  process  is  to  write  the  VGI  code  for  each  processor.  The  three  input  ports  10,  II,  and  12 
corresponds  to  the  three  queues  ql,  q2,  and  q3,  respectively.  The  three  output  ports  OO,  01,  and  02 
corresponds  to  the  three  output  buses  cO,  cl,  and  c2,  respectively.  When  the  queues  are  not  for  inputs, 
they  can  be  used  as  registers.  Here  are  the  VGI  code  for  each  task  and  the  VGI  main  program  for  the  ID 
convolution, 

/  ic  :)c  9|c  9|c  3tc  *  9|c  t  )|c  :fc  :|c  :fc  :(e  :|c  ]|c  :4c  :(c  :|c  :|c  :tc  :(e  *  *************  ic  :|c  :(c  }«(****  * 

Program  for  task  1 
Name :  k_ml . s 

Input:  a(0)  a(l)  a(2)  ... 

Output:  0  k(“l)a(0)  k(-l)a(l)  k(-l)a(2)  ... 

:4e  :4e  9|e  :4c  *  *  He  )4<  *  3|e  :|c  :|c  :4e  *  *  4c  3|c  ♦  :4c  *  *  )4e  :4c  3fc  3|c  :4c  *  *  :4e  *  ♦  ♦  34t  *  34c  *  9|c  4c  :|c  :4c  ♦  *  34c  3|c  :4c  :|c  :|c  #  :|c  :|c  ♦  :4c  / 
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Table  1:  The  switch  settings  for  the  ID  convolution 

Connection  Switches 

Output  Input 

OO  (P0.C03)  iOlPTICOS)  L1S4P1-C03 

02  (P0_C03)  10  (P3X03)  L1S16P0_C03  -  L2S3C03  -  L2S0C03  -  L1S7P3_C03 

OO  (Pl-CO^  10  (P2_C03)  L1S4P2_C03 

02  (P1.C03)  II  (P3-C03)  L1S16P1_C03  -  L2S7C03  -  L2S4C04  -  L1S9P3-C03 

02  (P2_C03)  12  (P3_C03)  L1S6P3-C03 

OO  (P3-C03)  II  (I0_C03)  L1S14P3_C03  -  L1S10I0_C03 

OO  (I0_C03)  lO-Pad  (I0-C03)  L1S14I0-C03 

01  (10X04)  10  (P0-C03)  L1S15I0_C04  ~  L2S15C04  -  L2S46C03  -  L2S14C03  -  L1S8P0-C03 

lOJPad  (lO-COiT  I2(IQ-CQ4)  |  L1S12IQX04 


.init  qO  =  q 
.init  ql  =  r 
.init  q2  =  r 

.init  r3  =  16384  /*  k(“l):  0.25  (16384  X  2''-16)  */ 

.init  pc  =  0 

0:  rO  =  xor(ra,  ra) , 

c2  =  ra,  /♦  output  0  to  02  ♦/ 
next  =  1; 

1:  r4  =  or(qO,  qO) ,  /*  input  from  10  */ 
next  =  2; 

2:  rO  =  xor(ra,  ra) ,  clear  accumulator  ♦/ 
next  =  3; 

3:  rO  =  mstartu(ra,  r3,  r4) ,  /*  start  multiplication  */ 
next  =  4; 

4:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  5; 

5:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  6; 

6:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  7; 

7:  rO  =  mstepuCra,  r3,  rc) , 
next  =  8; 

8:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  9; 

9:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  10; 

10:  rO  =  mstepu(ra,  r3,  rc) , 

c2  =  ra,  /*  output  product  to  02  */ 
next  =  1; 

/**:|t:|e:»ej|c**Jtt*JfeJ|c5(e3|e3te5(c:|c5|c*3|e*3|e**3(e**5t«*J|e*****************5|'********** 

Program  for  task  2 
Name :  k_0 . s 
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Input:  a(0)  a(l)  a(2)  ... 

Output:  k(0)a(0)  k(0)a(l)  k(0)a(2)  ... 

]|e  9|e  ***  *********  9|(  ***♦  t  ***♦  ic  }|c  3(c  :ie  :|e  *  :|c  :|e  Jfcfe  *♦**♦*}((♦  3|c  9|e  }|e  9|c  3«c  / 

.init  qO  =  q 

.init  ql  =  r 

.init  q2  =  r 

.init  r3  =  32768  /♦  k(0):  0.5  (32768  X  2''-16)  */ 

.init  pc  =  0 

0:  r4  =  or(qO,  qO) ,  /*  input  from  10  ♦/ 
cO  =  ra,  /*  output  to  00  */ 
next  =  1; 

1:  rO  =  xor(ra,  ra) ,  /*  clear  accumulator  ♦/ 
next  =  2; 

2:  rO  =  mstartu(ra,  r3,  r4) ,  /*  start  multiplication  */ 
next  =  3; 

3:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  4; 

4:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  5; 

5:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  6; 

6:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  7 ; 

7:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  8; 

8:  rO  =  mstepuCra,  r3,  rc) , 
next  =  9; 

9:  rO  =  mstepuCra,  r3,  rc) , 

c2  =  ra,  /*  output  product  to  02  */ 
next  =  0; 


/****************:|t*j|t:|e:(c*******************:|t3|c**************** 

Program  for  task  3 
Name:  k_pl.s 

Input:  a(0)  a(l)  a(2)  ... 

Output:  k(+l)a(l)  k(+l)a(2)  ... 

*********************:|c********************:(c:|c**************/ 


.init  qO  =  q 
. init  ql  =  r 
.init  q2  =  r 


.init  r3  =  16384  /*  k(+l) :  0.25  (16384  X  2^16)  ♦/ 
.init  pc  =  0 

0:  r4  =  or(q0,  qO) ,  /*  input  from  10  */ 
cO  =  ra,  /*  output  to  00  */ 
next  =  1; 


1:  r4  =  or(q0,  qO) ,  /♦  input  from  10  */ 
cO  =  ra,  /*  output  to  00  */ 
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next  =  2; 

2:  rO  =  xor(ra,  ra) ,  /♦  clear  accumulator  */ 
next  =  3; 

3:  rO  =  mstartu(ra,  r3,  r4)  ,  /*  start  multiplication  */ 
next  =  4; 

4:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  5; 

5:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  6; 

6:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  7; 

7:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  8; 

8:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  9; 

9:  rO  =  mstepu(ra,  r3,  rc) , 
next  =  10; 

10:  rO  =  mstepu(ra,  r3,  rc) , 

c2  =  ra,  /*  output  to  01  */ 
next  =  1; 


Program  for  task  4 
Name :  sum .  s 

Input:  0  k(-l)a(0)  k(-l)a(l)  ... 

k(0)a(0)  k(0)a(l)  k(0)a(2)  ... 
k(+l)a(l)  k(+l)a(2)  ... 

Output:  k(0)a(0)+k(+l)a(l)  k(-l)a(0)+k(0)a(l)+k(+l)a(2)  ... 

****3)c*3|cjJc*:)e:|ej(e**:|t3(e***j|e:|t***ic3»c*3|e**3»tJte*:|e**j|c*******3»cJ|t******3tc***3le*/ 

.init  qO  =  q 
.init  ql  =  q 
.init  q2  =  q 

.init  pc  =  0 

0:  rO  =  add(qO,  ql) , 
next  =  1; 

1:  rO  =  add(q2,  ra) , 
cO  =  ra, 
next  =  0; 


/*  add  values  from  10  and  11  */ 

/*  add  value  from  12  */ 

/*  output  sum  to  00  */ 


/♦***jtc***:|c3(c***j(c3|e**3|c***:»eJt«J|e***3»e*********3|c3|e3(c********3|e3|e3|e5|«5K5|«**** 

This  is  the  main  program  that  contains  the  configuration 
information  for  the  ID  convolution. 

Name :  convld . set 

♦  **♦****♦*♦♦*  :<e  j|cjK  ***♦♦  :|e  jK  3(e  *  *3|e  j|e  ))c  *  :|c  J|e  ****  *3»C  / 

#  -  Nanoprocessor  allocation:  - 

P0_C03  k^pl 
P1„C03  k_0 

P2_C03  k_ml 
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P3_C03  sum 

I0.C03 

I0_C04 


#  -  Switch  settings: 


# 

CN 

PN 

MN  IN 

SN 

# 

lO.Pad  — >  12* 

■-I0_C04 

s 

4 

-1 

-1  0 

12 

# 

01’I0_C04 

I0-P0_C03 

s 

4 

-1 

-1  0 

15 

s 

4 

"1 

“1  -1 

14 

s 

3 

-1 

-1  -1 

46 

s 

3 

-1 

-1  -1 

14 

s 

3 

0 

“1  -1 

8 

# 

o 

o 

1 

o 

1 

o 

o 

CO 

1 

1 

V 

I0“P1^C03 

s 

3 

1 

-1  -1 

4 

# 

02-P0_C03  — •> 

I0-P3_C03 

s 

3 

0 

-1  -1 

16 

s 

3 

-1 

-1  -1 

0 

s 

3 

“1 

-1  -1 

3 

s 

3 

3 

“1  -1 

7 

# 

00-Pl_C03  — > 

I0-P2_C03 

s 

3 

1 

“1  -1 

4 

# 

o 

to 

1 

1 

o 

o 

CO 

1 

1 

V 

I1“P3_C03 

s 

3 

1 

-1  -1 

16 

s 

3 

-1 

-1  “1 

4 

s 

3 

“1 

“1  -1 

7 

s 

3 

3 

-1  -1 

9 

# 

02*-P2_C03  -“> 

I2-P3_C03 

s 

3 

“1 

-1  -1 

6 

# 

00-P3^C03  — > 

I1-I0^C03 

s 

3 

3 

-1  -1 

14 

s 

3 

-1 

-1  0 

10 

# 

A 

1 

1 

CO 

o 

o 

1 

o 

M 

1 

O 

O 

lO-Pad 

s 

3 

-1 

-1  0 

14 

/:►******  :|c  3|e  j)c  *  3*c*  ♦♦  j|e  **  )|c  jK  :|c  ^  :|e  ****  *  3|e  **  **  jjc  ♦♦*♦**%  :(c  :|e  */ 

Programming  with  Signal  Flow  Graphs 

Stream  data  processing  algorithms  can  often  be  described  naturally  by  block  diagrams  in  which  computa¬ 
tional  nodes  are  interconnected  by  links  that  represent  data  streams  [3, 13].  The  development  of  stream  data 
algorithms  on  the  VGI  computer  can  be  better  supported  using  signal  flow  graphs. 
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Programming  Model 

The  signal  flow  graph  (SFG)  programming  model  allows  the  programmer  to  represent  any  stream  data  parallel 
algorithm  as  a  directed  graph  with  nodes  denoting  tasks  and  edges  denoting  data  flow.  The  SFG  model 
supports  hierarchical  graph  structure  in  the  sense  that  a  node  of  a  graph  may  be  a  subgraph  representing  a 
sub-algorithm.  In  the  SFG  programming  paradigm,  the  programmer  divides  the  algorithm  into  small  tasks. 
Each  task  takes  inputs  from  some  other  tasks  and/or  the  external  input,  and  generates  outputs  for  other 
tasks  and/or  the  external  output.  If  one  task  outputs  to  another  task,  there  should  be  an  edge  from  the 
node  representing  the  first  task  to  the  node  representing  the  second  task.  Each  task  can  be  independently 
coded  with  a  function  that  transforms  the  input  streams  into  the  desired  output  streams. 

In  the  ID  convolution  example,  the  programmer  only  needs  to  develop  an  SFG  shown  in  Figure  10.  We 
can  abstract  the  SFG  as  a  node  as  shown  in  Figure  12  for  constructing  an  SFG  for  a  2D  convolution. 

a(i) 

k(-l)a(i-l)+k(0)a(i)+k(+l)a(i+l) 

Figure  12:  An  abstract  node  representing  the  ID  convolution 


We  then  implement  the  2D  convolution 

bij  =  fc-i.-iUi-i  j-i  +  fc-i.oOi-i.i  + 

+  A:+i,_iai+ij_i  +  k+iflUi+ij  +  fc+i,+iat+i,j+i 


with  the  SFG  as  shown  in  Figure  13. 


Figure  13:  An  SFG  for  the  2D  convolution 
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SFG  GUI  and  SFG-to-VGI  Mapping 


To  support  the  SFG  programming  paradigm,  we  need  a  graphical  user  interface  (GUI)  for  the  programmer 
to  edit  signal  flow  graphs.  In  addition  to  the  graph  editing  capabilities,  The  GUI  should  provide  a  way  for 
the  programmer  to  specify  the  function  of  each  node  as  an  attribute  of  the  node. 

The  major  problem  with  SFG  programming  is  that  we  need  an  effective  algorithm  to  assign  each  task  to 
a  physical  processor,  to  assign  each  end  of  each  edge  to  a  port  of  the  processor,  and  to  And  switches  to  form 
a  path  for  each  edge.  To  solve  this  difficult  problem,  we  need  to  model  the  VGI  computer  as  a  graph  and 
devise  a  mapping  algorithm  that  map  SFGs  to  the  VGI  graph.  More  research  is  needed  to  accomplish  this. 

Programming  in  Functional  Language  SISAL 

The  SISAL  language  is  a  functional  programming  language  intended  for  high  performance  scientific  com¬ 
puting  on  highly  parallel  computers  [9,  12].  Streams  of  integers  and  streams  of  integer  arrays  are  natural 
representations  for  the  data  processed  in  signal,  image,  and  video  processing  applications  [5].  The  SISAL 
language  naturally  supports  development  of  stream  data  algorithms. 

Programming  Model 

The  SISAL  programming  model  allows  the  programmer  to  specify  naturally  a  stream  processing  algorithm 
as  a  recursive  function  that  defines  a  result  stream  as  the  result  of  concatenating  a  new  element  with  the 
stream  produced  by  a  recursive  application  of  the  function.  In  this  programming  paradigm,  the  programmer 
develops  stream  processing  algorithms  without  dealing  with  individual  small  tasks  as  in  the  SFG  and  machine 
level  programming  models. 

A  stream  is  a  data  type  that  may  be  created  for  any  type.  For  example,  -htype  T2  =  stream  [Tl]-|-  defines 
stream  type  T2  for  type  Tl.  The  values  of  T2  are  streams  (i.e.,  sequences  of  indefinite  length)  of  elements  of 
type  Tl.  Three  basic  operations,  stream^f  irst  (for  first  element  of  a  stream),  stream^rest  (for  the  stream 
of  the  remaining  elements),  and  I  I  (for  concatenating  two  streams),  are  provided  for  stream  manipulation. 
A  stream  of  a  single  element  x  of  type  T  can  be  constructed  by  stream  T  [x] .  Elements  of  a  stream  can 
also  be  accessed  using  subscripts.  As  an  example,  our  ID  convolution  can  be  expressed  in  SISAL  as  follows. 

type  Signal  =  stream  [integer]  ; 
function  convld  (a;  Signal  returns  Signal) 

stream  integer  [  (kml*a[0]+k0*a[l]+kpl*a[2]  ] 

I  I  convld  (stream_rest  (a)) 
end  function 

SISAL  Compiler 

To  support  SISAL  programming,  we  need  an  SISAL  compiler  to  translate  SISAL  programs  into  efficient  VGI 
programs.  One  importrint  issue  in  compiling  SISAL  programs  is  the  recognition  of  recursive  functions  that 
can  be  transformed  into  non-recursive  signal  flow  graphs.  More  research  is  needed  to  construct  an  efficient 
SISAL  compiler  for  the  VGI  computer. 
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Conclusions  and  Future  Work 


The  VGI  parallel  computer  has  been  proposed  by  Srini  and  Rabaey  for  processing  stream  data  present  in 
video,  graphics,  and  image  processing  applications.  It  is  a  hybrid  architecture  with  features  of  von  Neumann 
and  dataflow  architectures.  It  implments  the  static  dataflow  model  with  multithreading  concept.  It  provides 
a  simple  synchronization  mechanism,  for  any  two  communicating  processors.  So  far,  the  research  on  the 
VGI  computer  has  been  focused  on  its  hardware.  However,  the  success  of  the  VGI  computer  lies  in  its 
programmability.  This  report  has  discussed  some  issues  related  to  programming  models  and  tools  on  top  of 
the  VGI  hardware.  More  research  efforts  are  needed  to  have  a  more  user-friendly  environment  for  solving 
stream  data  processing  problems  on  the  VGI  computer. 

First  of  all,  research  is  needed  to  provide  and  improve  several  tools  for  assisting  machine  level  program¬ 
ming.  A  functional  VGI  simulator  has  been  under  development  which  incorporates  the  VGI  assembler  and 
the  routing  assistaint.  The  routing  assistant  should  be  an  interactive  tool  that  displays  all  the  switches  with 
already  taken  paths  clearly  maxked  on  the  screen.  In  addition  to  the  capability  to  suggest  paths  between  an 
output  port  of  a  processor  and  an  input  port  of  another  processor,  it  should  allow  the  programmer  to  choose 
switches  interactively  to  form  a  path  and  to  modify  previously  found  paths.  Furthermore,  a  timing  tool 
is  needed  for  estimating  the  execution  time  of  VGI  programs.  Since  the  VGI  computer  executes  multiple 
threads  communicating  asynchronously,  it  is  hard  for  the  programmer  to  trace  the  execution  to  estimate  the 
execution  time.  The  timing  tool  should  compute  the  latency  (the  time  between  the  first  input  and  the  first 
output)  and  the  per  output  time  (the  time  between  an  output  and  the  next  output). 

Secondly,  research  is  needed  on  modeling  the  VGI  computer  as  a  graph  and  mapping  SFGs  to  the  VGI 
graph  to  support  SFG  programming.  The  processors  can  be  modeled  as  nodes.  The  bus  segments  can  also 
be  modeled  as  nodes.  The  switches  can  be  modeled  as  edges.  Finding  paths  for  SFG  edges  is  then  converted 
to  a  problem  of  finding  disjoint  paths  in  the  VGI  graph. 

Thirdly,  research  is  needed  on  compilation  techniques  for  translating  SISAL  programs  into  SFGs.  There 
are  algorithms  to  recognize  simple  recursive  SISAL  functions  and  translate  them  into  SFGs.  More  research 
should  be  performed  to  handle  more  general  cases. 
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ABSTRACT 

Previous  studies  of  surface  micro-machined  polycrystalline  silicon  MEMS  thermal 
micro-actuators  have  established  that  these  simple  and  compact  devices  can  provide 
deflections  on  the  order  of  10  micrometers  at  CMOS  compatible  drive  voltages.  These 
thermo-mechanical  devices  operate  by  differential  thermal  expansion  caused  by  ohmic 
heating  in  higher  resistance  regions  of  the  double  beam  device.  Because  of  thermal 
conductivity  and  temperature  dependent  resistivity  in  polycrystalline  silicon,  the 
temperature  profile  along  the  “pusher”  section  of  the  beam  is  not  uniform,  and  motion 
simulation  of  the  thermal  actuator  can  be  complex.  This  work  represents  an  initial 
attempt  at  correlation  of  experimental  near-IR  observations  of  thermal  actuator  operation 
to  a  new  commercial  Joule-heating  MEMS  simulation  package  that  integrates  pertinent 
mechanisms  into  a  no  parameter  fit  model. 
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Introduction 


Comtois  et  al  ‘  have  presented  poly-silicon  in-plane  horizontal  double  beam  thermal 
actuators  that  demonstrate  large  deflections  (>10]im)  and  operate  at  CMOS  compatible 
drive  voltages  and  at  current  levels  below  5mA.  This  actuator  concept  has  been  applied 
to  a  wide  variety  of  applications,  including  scanning  and  rotating  micro-mirrors,  Fresnel 
lenses,  movable  gratings,  linear  micro-motors,  and  self-assembly  systems.^’  A 

representative  diagram  of  this  beam  design,  used  both  in  the  experimental  and 
simulations  studies  for  report,  is  indicated  in  Fig.  1.  Differential  ohmic  heating  effects  in 
a  double  beam  structure  drive  these  thermal  actuators  which  consist  of  two  parallel 
beams,  coupled  at  the  free  end,  with  one  beam  significantly  more  electrically  and 
thermally  conductive  than  the  second  beam.  As  the  more  resistive  beam  (with  a 
smaller  cross  section)  heats  up  relative  to  the  attached  less  resistive  beam  (with  a  larger 
cross  section),  the  effect  of  differential  thermal  expansion  causes  the  narrower  beam  to 
push  on  the  wider  beam  at  the  free  end.  The  experimentally  optimized  design  in  Fig.  1 
has  been  used  in  number  of  other  studies,  with  limited  theoretical  evaluation.^’  ^ 

Functional  modeling  of  these  devices  is  complex  because  localized  ohmic  heating 
depends  on  the  temperature  distribution  in  the  expanding  beam  elements.  This 
temperature  distribution  depends  on  a  number  of  criteria,  primarily  the  absolute  thermal 
conductivity  of  the  heated  elements,  and,  to  a  lesser  extent,  the  temperature  dependence 
the  resistance  of  the  doped  poly-crystalline  silicon  elements,  which  heat  up.  Simulation 
studies  in  this  work  utilize  a  new  MemCad  4.0  MEMS  simulation  module  available  from 
Microcosm,  Inc.^  This  “MemECad”  Joule  heating  module,  in  conjunction  with  other  parts 
of  this  commercial  finite  element  design  and  simulation  package,  computes  the  thermal 
and  electrical  potential  field  distributions  resulting  from  an  applied  voltage  or  current 
flow  through  a  resistive  material.  The  localized  Joule  heating  effects  are  coupled  to  the 
MemCad  mechanical  simulation  package  to  determine  overall  displacement  and  stresses 
related  to  thermal  expansion.  This  iterative  approach  accounts  for  the  voltage  potential 
distribution  throughout  the  volume  of  the  double  beam  structure,  as  well  as  the  impact  of 
localized  differential  thermal  conductivity,  which  is  related  to  the  different  geometrical 
cross  sections  in  different  parts  of  the  structure.  The  accumulated  thermal  deformations 


31-3 


of  the  double  beam  structure  are  determined  from  the  thermal  expansion  effects  in 
different  finite  elements  of  the  structure. 

Devices 

The  devices  used  for  the  experimental  observations  were  originally  fabricated  at  the 
MCNC/MUMPS  processing  facility  as  part  of  an  array  of  test  structures  for  related 

g 

work.  Test  chips  as  received  from  MCNC  were  released,  mounted  and  bonded  in  an 
open  package.  A  SEM  micrograph  of  a  typical  device  is  indicated  in  Fig.  2.  The  length 
of  the  2.5|im  wide,  2.0  |a.m  thick,  narrow  “pusher”  beam  is  235|im.  The  nominal 
resistivity  of  the  polycrystalline  silicon  is  10±5  QJ  ,  resulting  in  a  room  temperature 
device  resistivity  of  1300+650i2.  Because  of  this  resistivity  variability,  device  current  is 
a  more  pertinent  independent  variable  in  terms  of  calibrating  thermal  actuator  functions. 

These  devices  were  connected  individually  to  a  stable  variable  voltage  DC  power  supply, 
and  the  current  was  monitored  with  a  digital  milliamp  meter.  Beam  deflection  was 
observed  through  a  microscope  coupled  to  a  PULNK  TM  840  TV  camera,  which  has 
significant  relative  sensitivity  below  ^l.Ofim  wavelength.  Picmres  were  captured  in  a 
bit  map  format.  A  typical  captured  image  of  an  operational  device  is  given  in  Fig.  3, 
which  was  taken  at  a  temperature  just  before  the  luminosity  of  near-IR  radiation  can  be 
observed.  Deflections  can  be  measured  using  the  vernier  structure  at  the  coupled  free  end 
of  the  beam.  Images  were  taken  both  in  bright  optical  field  illumination,  and  in  dim 
optical  illumination,  in  order  to  highlight  the  IR  luminosity  of  the  hot  part  of  the  narrow 
beam.  The  IR  luminosity  as  seen  with  the  PULNIX  camera  is  not  seen  for  low  applied 
voltages,  and  increases  in  intensity  with  voltage.  For  example,  no  IR  luminosity  is  seen 
for  the  device  of  Fig.  3  below  8  volts  applied  to  the  two-anchored  beam-ends.  The  sharp 
focus  position  of  the  microscope  for  bright  optical  illumination  and  sharply  defined  IR 
radiance  are  not  coincident,  most  likely,  because  of  the  wave  length  dependence  of  the 
optics.  Fig.  4  indicates  typical  behavior  of  an  acmator  indicated  in  Fig.  2  for  both  current 
and  displacement.  Precision  displacement  measurements  are  limited  by  the  use  of  vernier 
indicators  and  by  “stiction”  of  the  dimple  spacers  in  contact  with  the  substrate.  These 
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“dimples”  act  as  spacers  to  prevent  contact  of  the  double  beam  structure  with  the 
substrate  surface,  nominally  2(im  below  the  beam. 

Experimental  Results 

The  current  and  displacement  dependence  with  applied  voltage  for  a  typical  actuator  is 
indicated  in  Fig.  4.  The  displacement  measurements  are  based  on  observation  of  the 
vernier  at  the  free  end  of  the  beam,  and  are  also  subject  to  “stiction”  of  the  dimple  spacers 
on  the  substrate.  A  displacement  of  over  12  p.m  is  achieved  at  less  than  5mA,  in 
agreement  with  prior  observations.  ‘  The  current  dependence  with  voltage  is  sub-ohmic, 
most  likely  because  the  resistance  of  the  poly-silicon  declines  with  temperature.  No  IR 
luminosity  is  observed  for  this  device  below  8  volts  applied  to  the  beam  ends,  but  above 
this  value,  IR  luminosity  is  quite  apparent,  even  with  moderate  white  light  optical 
illumination,  as  seen  in  Fig.  5. 

The  temperature  at  the  center  of  the  IR  radiance  is  estimated  to  be  in  the  range  of  850  - 
1000  °K,  while  the  edge  of  the  IR  luminosity  is  estimated  as  800  °K  from  blackbody 
spectral  exitance  considerations.’  Clearly,  the  ohmic  temperature  distribution  along  the 
narrow  beam  is  far  from  uniform,  and  the  highest  temperature  region — corresponding  to 
the  elements  of  largest  thermal  expansion — extend  no  more  than  30%  of  the  length  of  this 
beam. 

The  wide,  cold  side  of  the  coupled  beam  stmcture  has  a  flexure  section  40|xm  long  with  a 
2.0x2.0  |i,m^  cross  section  that  serves  as  a  hinge.  Without  this  flexure  element,  little 
thermally  activated  deformation  would  be  seen  at  the  free  end  of  the  coupled  beam 
structure.  Because  the  area  cross  section  of  this  flexure  region  is  20  pet.  smaller  than  that 
of  the  “hot”  beam,  there  could  be  a  concern  that  Joule  heating  in  the  flexure  will  mitigate 
the  deformation  at  the  free  end  of  the  “hot”  beam.  The  IR  observation  indicates  that  both 
the  large  thermal  conductivity  and  the  small  voltage  drop  in  the  short  flexure  is  sufficient 
to  prevent  the  appearance  of  IR  luminosity  in  this  part  of  the  structure.  This  observation 
has  also  been  confirmed  by  the  simulation  studies. 
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The  range  of  IR  luminosity  never  extends  more  than  50  g,m  on  the  235  |im  long  pusher 
beam,  and  never  appears  on  the  40  pm  flexure.  Because  of  thermal  heat  flow  effects,  one 
might  presume  that  the  IR  luminescence  should  be  localized  about  the  center  of  the 
pusher  beam,  but  the  actual  position  varies  from  device  to  device,  even  on  the  same  chip. 
One  possibility  is  that  local  variations  of  the  effective  doping  in  the  pusher  beam  could 
shift  the  “hot  spot.”  Another  source  for  this  phenomena  could  be  local  etch  fabrication 
variations  in  the  cross  section  of  the  pusher  beam. 

MemCad  Simulations 

MemCad  is  an  integrated  set  of  MEMS  simulation  tools  enabling  the  design, 
specification,  modeling,  and  engineering  of  accurate  and  performance  predictive  devices 
and  systems  with  a  wide  variety  of  functional  properties  and  applications.  The  complete 
suite  of  MemCad  modules,  used  in  this  work,  resides  on  a  HP  workstation  at  the  AFRL 
Information  Technology  Division  at  Griffith  AFB,  Rome,  NY,  and  is  accessed  remotely 
at  Hanscom  AFB.  The  recent  availability  of  an  integrated  Joule  heating  module  provided 
an  opportunity  to  compare  experimental  observations  of  local  heating  and  thermal 
conductivity  for  thermal  actuators  to  the  new  simulation  code,  without  the  need  to  assume 
or  infer  a  temperature  profile  on  the  narrow  “pusher”  beam. 

Recently,  Allen  et  al.'”  have  modeled  a  very  similar  ohmic  heating  structure  using  the 
competitive  IntelliCad  simulation  program,"  getting  very  good  correlation  to  prototype 
experimental  devices.  Because  the  code  used  by  Allen  et  al.  did  not  have  a  Joule  heating 
module  at  the  time  of  their  work,  a  temperature  profile  of  the  narrow  beam  had  to  be 
devised.  In  the  current  work,  using  the  MemCad  MemETherm  module,  there  are  no 
fitable  parameters,  other  than  specification  of  poly-silicon  properties,  including  the  poly¬ 
silicon  resistivity,  and  the  voltages  applied  to  fixed  ends  of  the  double  beam  structure. 
Because  of  the  variability  in  Poly-Si  resistivity,  device  current  is,  generally,  a  more 
pertinent  independent  variable  than  applied  voltage. 
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Simulation  Results 


Simulated  results  using  MemCad  are  indicated  in  Fig  6.  Because  MemCad  does  not 
incorporate  a  model  for  material  plastic  flow  or  melting,  calculations  in  the  range  of 
temperatures  above  1 500  “K  are  not  meaningful.  As  clearly  evidenced  in  the  graph,  the 
simulated  displacement  and  reaction  force  at  the  anchors  increases  in  a  nominally  linear 
manner  with  temperature.  These  calculations  also  demonstrate  that  only  the  middle 
sections  of  the  pusher  beam  get  “hot,”  and  that  the  free  end  of  the  double  beam  structure 
gets  no  hotter  than  400  “K,  although  the  center  of  the  pusher  beam  can  reach  1500"K 
with  only  3.5  volts  applied  to  the  fixed  ends  of  the  beams,  as  indicated  in  Fig.  6.  The 
simulated  calculation  also  indicates  that  virtually  all  the  voltage  drop  is  across  the  pusher 
bream,  and  only  10  pet.  across  the  flexure.  Since,  in  addition,  both  ends  of  the  flexure  are 
constrained  to  close  to  room  temperature,  no  IR  “hot  spot”  can  develop  in  this  part  of  the 
structure. 

Summary 

Using  Near-IR  microscopy  on  MEMS  Thermal  Actuators,  reasonable  agreement  between 
experiment  and  a  new  Joule-heating  finite  element  simulation  module  has  been 
established.  This  work  demonstrates  the  need  to  develop  optimized  design  criteria  in  this 
class  of  devices  to  mitigate  the  thermal  conduction  effects  in  the  pusher  beam,  which 
degrades  displacement  performance.  This  work  also  indicates  the  need  to  incorporate  the 
temperature  variation  of  material  resistance  in  the  calculations  to  enhance  accuracy  of  the 
overall  model. 
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Anchor  - 


Direction  of  motion 


Figure  I.  Basic  layout  of  the  lateral  thermal  actuator.  Typical  dimensions  are:  ‘hot’  arm  2.5  pm  wide,  240  tim  long;  ‘cold’ 
arm  16  pm  wide.  200  pm  long;  flexure  2.5  pm  wide,  40  pm  long,  and  gap  2  pm  wide.  The  4  pm  square  dimples  prevent 
stiction.  Deflections  of  16  pm  can  be  achieved  at  3  V  and  3.5  mA  in  the  2  pm  thick  first  polysilicon  layer  of  the  MUMPS 
process. 
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Fig.  2  SEM  Micrograph  of  a  lateral  thermal  actuator.  The  fixed  vernier  grating 
has  a  period  of  4)im,  while  the  vernier  at  the  free  end  of  the  double  beam  has  a 
period  of  5pm.  The  double  beam  structure  is  2pm  thick  doped  poly-Si,  and  is 
suspended  2pm  above  the  substrate. 
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Fig.  3.  Image  of  typical  device  on  test  chip  with  voltage  applied  to  anchor  pads. 
For  this  device  no  IR  luminosity  is  seen  for  applied  voltages  below  8  volts. 
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A  13  Thermal  Actuator 
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Applied  Volts  DC 


Fig.  5  Image  of  same  device  indicated  in  Fig.  3  with  sufficient  voltage  applied  to 
indicate  near-IR  luminosity. 


Actuator  A1 3,  8.75  Volts,  4.782  mA 
Near  IR  luminosity  apparent.  Displacement:  13  microns 


Thermal  Actuator  Characterisitics 
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Amherst,  New  York  14260 
Telephone;  (716)  645-2422,  extension  2138 
Email:  msoum@eng.bufFalo.edu 

ABSTRACT 

This  paper  is  concerned  with  multidimensional  signal  processing  and  image  formation 
with  FOliage  PENetrating  (FOPEN)  airborne  imaging  radar  data  which  were  collected  by 
a  Navy  P-3  ultra  wideband  (UWB)  radar  in  1995  [Raw].  A  commonly-used  assumption 
for  the  processing  of  the  P-3  data  is  that  the  beamwidth  angle  of  the  radar  is  limited  to 
35  degrees  [Bes],  [Goo];  based  on  this  assumption,  the  PRF  of  the  P-3  SAR  system  yields 
alias-free  data  in  the  slow-time  Doppler  domain.  However,  controlled  measurements  with 
the  P-3  radar  have  indicated  a  beamwidth  which  exceeds  35  degrees  [Raw];  e.g.,  at  the 
radar  frequency  of  300  MHz,  the  beamwidth  angle  is  close  to  100  degrees. 

In  this  paper,  we  examine  a  method  for  processing  of  the  P-3  data  in  which  the 
incorrect  assumption  that  its  radar  beamwidth  angle  is  limited  to  35  degrees  is  not  imposed. 
In  this  approach,  a  SAR  processing  scheme  which  enables  the  user  to  extract  the  SAR 
signature  of  a  specific  target  area  (digital  spotlighting)  is  used  to  ensure  that  the  resultant 
reconstructed  SAR  image  is  not  aliased  [S94],  [S95],  [S99].  The  images  which  are  formed 
via  this  method  with  8192  pulses  are  shown  to  be  superior  in  quality  to  the  images  which 
are  formed  via  the  conventional  P-3  processor  with  16386  pulses  which  was  developed  at 
the  MIT  Lincoln  Laboratory  [Bes]. 

In  the  presentation,  we  also  introduce  a  method  for  converting  the  P-3  deramped 
data  into  its  alias-free  baseband  echoed  data,  and  the  resultant  interpretation  of  Radio 
Frequency  Interference  (RFI)  in  the  measured  P-3  data.  A  two-dimensional  spectral  do¬ 
main  method  for  calibration  from  an  in-scene  target  signature  is  discussed.  Squint-mode 
image  formation  with  the  P-3  data  is  also  examined.  The  original  figures  for  this  report 
may  be  obtained  by  contacting  the  author. 


32-2 


Alias-Free  Processing  of  P-3  SAR  Data 


Mehrdad  Soumekh 

1.  INTRODUCTION 

Synthetic  Aperture  Radar  (SAR)  is  an  airborne  or  satellite-borne  radar  system  which 
provides  high- resolution  maps  of  remote  targets  on  a  terrain,  a  planet,  etc.  The  AWACS 
airplanes,  which  are  used  extensively  in  reconnaissance  missions  in  the  Persian  Gulf  re¬ 
gion,  and  the  NASA  space  shuttles  are  equipped  with  this  radar  system.  SAR  systems 
are  a  highly  developed  combination  of  precision  hardware  and  electronic  design  for  data 
acquisition,  and  advanced  theoretical  principles  of  mathematics  and  physics  to  convert  the 
acquired  data  to  high-resolution  images. 

National  security  and  safety  issues,  such  as  the  dangers  from  Scud  missiles  encountered 
in  the  Gulf  War  and  drug  interdiction  problems,  created  the  need  for  developing  FOliage 
PENetrating  (FOPEN)  SAR  systems  that  utilize  Ultra  WideBand  (UWB)  UHF/VHF 
radars  for  detection  of  concealed  targets.  The  SAR  digital  signal  processing  issues  as¬ 
sociated  with  these  FOPEN  reconnaissance  SAR  systems  brought  new  complexities  and 
misunderstandings  for  those  familiar  with  the  traditional  SAR  systems.  This  is  mainly 
due  to  the  wide-bandwidth  and  wide-beamwidth  of  these  SAR  systems  which  are  desired 
to  operate  in  near  range  (few  hundred  meters)  as  well  as  far  range  (several  kilometers). 

One  of  the  first  set  of  FOPEN  SAR  data  were  collected  at  Stanford  Research  Institute 
(SRI).  A  stripmap  pulsed-type  SAR  was  used  in  the  frequency  band  of  [200,400]  MHz  to 
image  a  target  area  with  a  near  range  value  of  approximately  500  meters.  The  imaging 
results  indicated  interesting  features  and  capabilities  of  a  FOPEN  SAR  system  [S95].  How¬ 
ever,  to  accurately  retrieve  the  information  in  the  SRI  data  and  avoid  Doppler  aliasing, 
the  user  should  utilize  new  signal  processing  tools  for  analyzing  this  SAR  database,  and 
forming  images  from  it. 

Imaging  with  FOPEN  SAR  data  requires  the  use  of  a  new  SAR  imaging  method 
(for  example  see  [Caf],  [Car],  [S90],  [S91],  [S92],  [S94],  [S95],  [S99])  which  is  based  on  the 
Gabor’s  theory  of  wavefront  reconstruction  [Gab].  SAR  wavefront  reconstruction  theory  not 
only  provides  a  tool  for  SAR  image  formation  but  also  reveals  functional  properties  of  the 
SAR  signal  which  contradict  or  were  not  predicted  in  the  approximation-based  theoretical 
foundation  of  either  range-Doppler  imaging  or  polar  format  processing  [S95],  [S99]. 

For  instance,  the  beamwidth  angle  of  the  SRI  radar  was  in  excess  of  90  degrees.  It 
can  be  shown  that  the  Pulse  Repetition  Frequency  (PRF)  which  was  used  for  the  SRI 
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data  collection  was  not  sufficient  to  accommodate  such  a  large  beamwidth  angle;  as  a 
result,  the  SRI  data  contained  slow-time  Doppler  aliasing.  The  wavefront  reconstruction 
theory  reveals  unique  functional  properties  of  and  redundancies  in  the  SAR  signal  which 
cannot  be  found  in  any  other  information  processing  modality.  These  redundancies  could 
be  exploited  to  recover  uncorrupted  SAR  data  from  the  collected  aliased  SRI  data. 

While  slow-time  Doppler  aliasing  is  an  issue  in  the  processing  of  the  SRPs  FOPEN 
data,  however,  most  of  the  users  of  the  SRI  data,  who  had  background  in  either  range- 
Doppler  imaging  or  polar  format  processing,  reported  problems  which  stem  from  incorrect 
implementation  of  the  SAR  wavefront  reconstruction  algorithm.  It  turns  out  that  an  ex¬ 
tensive  theoretical  and  practical  knowledge  base  has  been  shaped  by  the  inherent  approxi¬ 
mations  in  the  range-Doppler  imaging  and  polar  format  processing  methods  over  the  past 
forty  years;  some  of  these  concepts  are  either  incorrect  or  not  applicable  when  viewed  in 
the  framework  of  the  modern  high-resolution  or  FOPEN  SAR  systems,  and  the  wavefront 
reconstruction  theory. 

A  more  extensive  FOPEN  stripmap  SAR  data  collection  was  carried  out  in  1995  [Raw] 
with  a  Navy  P-3  data  by  the  Environmental  Research  Institute  of  Michigan  (ERIM);  the 
resultant  database  is  referred  to  as  P-3  data.  The  P-3  SAR  system  utilized  a  chirp  radar 
within  the  frequency  band  of  [215,730]  MHz  with  a  near  range  value  of  approximately  5400 
meters.  The  P-3  data  posed  similar  challenges  and  problems  as  the  SRI  did  for  the  user. 

The  P-3  radar  also  possessed  a  relatively  wide  beamwidth  angle  [Raw].  Similar  to 
the  SRI  data,  the  PRF  for  the  P-3  radar  was  not  sufficient  enough  to  prevent  slow-time 
Doppler  aliasing  within  the  large  beamwidth  angle  of  the  radar.  (The  wide  beamwidth 
angle  for  the  P-3  data  also  posed  other  practical  problems,  for  example,  processing  a 
relatively  large  number  of  slow-time  samples.)  The  preliminary  processing  of  the  P-3  data, 
which  was  done  at  ERIM,  was  based  on  processing  a  31.7  degree  beamwidth  angle.  This 
was  rationalized  based  on  the  following  [Goo]: 

For  UWB/WB  SAR  systems  which  operate  at  low  frequencies,  the  above  ('theoreti¬ 
cal  j  definition  (for  the  radar  beamwidth  angle j  leads  to  an  impractical  wide  coherent 
integration  angle.  As  an  example,  the  PS  UWB  SAR  would  require  a  78. S  degree  in¬ 
tegration  angle  based  on  this  definition.  Instead,  we  suggest  a  working  definition  of 
azimuth  resolution  based  on  the  center  wavelength.  This  definition  yields  a  more  prac¬ 
tical  integration  angle  of  31.7  degrees  for  the  PS  UWB  system  at  a  center  frequency 
of  469.5  MHz. 
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The  final  processing  of  the  P-3  data  was  done  by  a  group  at  the  MIT  Lincoln  Labo¬ 
ratory  (MIT-LL)  [Bes].  The  P-3  processor  which  was  implemented  at  the  MIT-LL  mostly 
utilized  the  recommendations  and  procedures  which  were  suggested  by  the  ERIM  group; 
for  instance,  the  processing  was  performed  on  a  35  degree  beamwidth  angle.  We  will 
show  later  that  the  MIT-LL  P-3  processor  suffered  from  slow-time  Doppler  aliasing  de¬ 
spite  limiting  the  beamwidth  angle  to  35  degrees  (Section  4).  Moreover,  there  existed  other 
problems  with  the  MIT-LL  P-3  processor  which  introduced  other  forms  of  artifacts  in  the 
reconstructed  SAR  image;  these  will  be  discussed. 

Unfortunately,  the  other  P-3  processors  which  are  found  in  the  literature  suffer  from 
various  misunderstandings,  misconceptions,  and  incorrect  implementation  of  the  SAR 
wavefront  reconstruction.  For  instance,  in  [Kir],  the  authors  mention  a  problem  with 
the  wavefront  reconstruction  which  does  not  exist.  Then,  to  solve  the  problem,  they  in¬ 
troduce  an  approximation  which  is  attributed  to  the  wavefront  reconstruction  though  the 
approximation  seems  to  be  the  one  which  is  used  in  the  polar  format  processing.  It  is  likely 
that  the  problem  which  is  observed  by  these  authors  in  the  P-3  wavefront  reconstruction 
is  due  to  the  use  of  an  incorrect  reference  fast-time  point  [S99],  that  is,  an  error  in  the 
implementation  of  the  algorithm. 

Recently,  the  use  of  the  direct  Time  Domain  Correlation  (TDC)  or  backprojection 
algorithm  for  SAR  image  formation  has  become  popular  [Bar],  [Cur],  [S99].  The  backpro¬ 
jection  algorithm  is  a  relatively  simple  approach  for  forming  accurate  SAR  images  though 
it  carries  a  relatively  heavy  computational  burden.  We  should  point  out  that  irrespective 
of  the  algorithm  used  for  image  formation  (for  example,  wavefront  or  backprojection),  an 
aliased  SAR  database  results  in  artifacts  and  loss  of  contrast  in  the  reconstructed  image. 
Examples  of  this  phenomenon  with  the  backprojection  reconstructions  with  the  SRI  data 
are  provided  in  [S99]. 

This  report  provides  a  wavefront  reconstruction  theory  framework  for  alias-free  signal 
processing  of  P-3  data.  The  scheme  is  based  on  what  is  referred  to  as  digital  spotlighting  of 
stripmap  SAR  data  [S95],  [S99];  this  method  was  also  utilized  for  the  alias-free  processing 
of  the  SRI  data  in  [S99].  The  resultant  digitally-spotlighted  P-3  data  are  then  used  to 
form  the  image  of  the  target  scene  via  the  wavefront  reconstruction  algorithm. 

The  processing  is  also  based  on  converting  the  deramped  P-3  data  into  alias-free 
baseband  echoed  data  of  the  target  area  [S99].  This  operation  has  two  desirable  features. 
First,  the  deramped  data  suffers  from  what  is  called  the  Residual  Video  Phase  (RVP)  error 
which  is  commonly  removed  using  certain  approximations  [Car];  baseband  echoed  data  do 
not  contain  RVP  error. 
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In  addition,  FOPEN  SAR  frequency  band  usually  covers  a  part  of  the  radio,  television 
and/or  communication  channels;  these  produce  what  is  referred  to  as  Radio  Frequency 
Interference  (RFI)  for  FOPEN  SAR  systems.  These  RFI  sources  are  fairly  narrowband 
and  commonly  visible  in  the  baseband  echoed  data.  However,  the  deramping  operation 
alters  the  spectral  nature  of  the  RFI  sources  (band  broadening)  which  makes  it  difficult 
to  excise  the  RFI  from  the  desired  SAR  data.  In  fact,  as  we  will  see  in  Section  4,  the  RFI 
suppression  which  is  performed  in  the  MIT-LL  P-3  processor  results  in  degradations  in  the 
reconstructed  image. 

We  will  show  that  the  baseband  echoed  data,  which  are  restored  from  the  deramped 
P-3  data,  provide  a  two-dimensional  fast-time  and  slow-time  spectral  domain  in  which  the 
relatively  strong  direct  path  RFI  sources  appear  as  isolated  signatures  in  the  fast-time 
frequency  and  Doppler  (aspect  angle)  domains  (Sections  3  and  4).  The  user  can  utilize  a 
two-dimensional  nulling  scheme  in  this  domain  to  reduce  the  RFI  effects  with  a  minimal 
amount  of  distortion  in  the  desired  SAR  data. 

We  will  examine  issues  which  are  associated  with  calibration  of  the  P-3  data  with 
respect  to  the  phase  functions  of  the  radar  radiation  pattern  and  the  radar  signal  spectrum. 
A  two-dimensional  calibration  phase  function  in  the  spectral  domain  is  developed  using 
the  signature  of  an  in-scene  target  (Section  5). 

The  MIT-LL  processor  is  also  known  to  yield  artifacts  at  the  azimuth  points  which  are 
far  from  the  aimpoint  (that  is,  an  image  formation  which  is  referred  to  as  squint-mode  SAR 
processing  [Car],  [S92],  [S94],  [S95],  [S99]).  While  these  degradations  have  been  attributed 
to  the  errors  in  the  GPS  motion  compensation  [Bes],  we  will  show  that  these  are  byproducts 
of  processing  aliased  data  (Section  4).  We  will  examine  alias-free  processing  of  the  P-3 
data  in  the  squint-mode,  and  provide  reconstruction  results  for  this  SAR  modality  using 
P-3  data  (Section  6). 

We  begin  with  an  analysis  of  stripmap  SAR  system,  and  the  P-3  SAR  system  data 
collection  (Section  2). 


2.  STRIPMAP  SAR  SYSTEM 

A.  Narrow-Bandwidth  and  Narrow-Beamwidth  Model 

Consider  the  target  region  within  the  radar  range  swath  which  is  composed  of  a  set  of 
point  reflectors  with  reflectivity  <7^  that  are  located  at  the  coordinates  (a;f,y£ )(£  =  !,  2,...) 
in  the  spatial  (a;,  y)  domain.  (A  discrete  target  model  is  used  for  notational  simplicity;  the 
principles  which  follow  are  also  applicable  for  a  continuous  target  region.)  The  variable  x 
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is  used  for  the  range  domain  (also  slant  range  or  down  range),  and  the  variable  y  identifies 
the  cross-range  (also  called  azimuth  or  along  the  track). 

The  synthetic  aperture  domain  is  identified  by  the  variable 


u  =  Vr  T, 


where  Vr  is  the  speed  of  the  radar-carrying  aircraft  in  the  along-track  (azimuth)  domain, 
and  r  represents  the  slow-time  domain.  The  sample  spacing  in  the  slow-time  domain  is 
dictated  by  the  PRF  of  the  radar  system; 

Ar  =  — — ; 

^  PRF’ 

thus,  the  sample  spacing  in  the  synthetic  aperture  domain  is 


A 


U 


Vr 

PRF’ 


The  mean  range  swath  of  the  radar  is  denoted  with  Xc,  see  Figure  1.  Let  the  length 
of  the  radar  swath  in  the  range  domain  be  2Xo.  Thus,  the  range  values  of  the  targets  in 
the  illuminated  target  area  satisfy 

xt  G  [Ac  —  Ao,  Ac  Ao] . 

(In  the  case  of  P-3  data,  Ac  ~  5871.2  m,  and  Ao  ~  483.7  m.)  In  the  conventional  processor 
for  a  narrow-bandwidth  and  narrow-beamwidth  stripmap  SAR  system,  the  user  identifies 
the  azimuth  region  for  image  formation 

y£€  [-lo,lo]. 


Under  the  assumption  that  the  transmitting/receiving  radar  has  a  narrow-bandwidth 
and  a  narrow-beamwidth,  a  constant  beamwidth  angle  (or  beamwidth  extent  ±R;  see 
Figure  1)  is  used  for  slow-time  coherent  integration  of  the  SAR  data  at  all  the  desired 
range  bins  within  the  radar  range  swath.  This  corresponds  to  a  synthetic  aperture  interval 
of  (see  Figure  1) 

u  G  [— L,  L], 


where 


L  =  Fo  +  5. 
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If  the  frequency  domain  methods  are  used  for  this  processing,  the  user  should  make  sure 
that  circular  convolution  aliasing  is  avoided.  (This  is  a  spatial  domain-based  aliasing  which 
is  different  from  the  slow- time  Doppler  domain  aliasing.) 

B.  Wide-Bandwidth  and  Wide-Beamwidth  P-3  Model 

The  wide-bandwidth  and  wide-beamwidth  of  the  P-3  SAR  data  poses  new  challenges 
which  are  not  predicted  in  the  conventional  stripmap  SAR  processors.  Figures  2a-b  show 
examples  of  the  beamwidth  angle  of  the  P-3  radar  versus  the  radar  frequency  [Raw].  These 
figures  show  that  the  P-3  radar  beamwidth  angle  is  within  40  to  110  degrees  in  the  radiated 
chirp  radar  signal  ([215,730]  MHz). 

However,  the  results  which  are  shown  in  Figures  2a-b  represent  the  3  dB  beamwidth 
of  the  radar.  For  alias-free  processing  of  the  SAR  data,  the  user  should  also  incorporate 
the  first  side  lobe  of  the  radar  in  calculating  the  radar  beamwidth  [S99].  We  should  also 
point  out  that  a  6:1  tapped  delayed  line  (lowpass  Doppler  filter)  in  the  slow-time  domain 
was  used  to  reduce  the  beamwidth  of  the  P-3  radar  [Bes],  [Raw].  As  we  will  show  in  our 
results,  this  lowpass  Doppler  filter  was  not  successful  in  sufficiently  reducing  the  beamwidth 
to  prevent  slow-time  Doppler  alia.sing.  In  fact  the  minimum  beamwidth  angle  for  the  P-3 
radar  after  lowpass  Doppler  filtering  turns  out  to  be  over  50  degrees  in  the  [215-550]  MHZ 
radar  frequency  band;  this  will  be  shown  with  the  real  P-3  data. 

Consider  the  P-3  data  over  N  =  16384  slow-time  samples;  this  approximately  corre¬ 
sponds  to  a  35-degree  beamwidth  angle  for  the  center  of  target  area  (called  aimpoint  in 
[Bes]).  With  the  sample  spacing  of 

A„  =  .4  m, 

in  the  synthetic  aperture  domain,  the  half-size  of  the  processed  synthetic  aperture  is 

.  N  Au 

L  =  =  3276.8  m. 

Depending  on  the  choice  of  Yq  ,  the  user  faces  different  constraints  for  alias-free  processing 
of  the  P-3  data. 

An  example  is  shown  in  Figure  3a.  In  this  scenario,  the  desired  target  area  is  within  the 
radar  radiation  pattern  for  all  the  available  radar  frequencies,  from  the  SAR  signal  theory 
point  of  view,  the  resultant  database  should  treated  as  a  spotlight  SAR  data.  (Examples 
of  this  for  the  SRI  data  are  provided  in  [S99].)  In  this  case,  the  sampling  in  the  synthetic 
aperture  domain  should  satisfy  [S92],  [S94],  [S99]: 

A  <  _ _ 

4  sin  Umax 
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where  Amin 
MHz),  and 


is  the  wavelength  at  the  highest  frequency  of  the  radar  (Amin 


^max  =  arctan  ), 

c  -^0 


=  .41  m  at  730 


is  the  maximum  aspect  angle  of  the  target  area  as  seen  by  the  radar  within  the  synthetic 
aperture  interval  of  u  €  [—L,L]]  see  Figure  3b. 


C.  MIT  Lincoln  Laboratory  P-3  Processor 

The  MIT-LL  P-3  processor  is  based  on  coherent  integration  of  16384  slow-time  pulses 
over  4096  fast-time  samples.  The  resultant  SAR  image  has  Ux  =  4096  range  bins  with 
sample  spacing  of  =  .236  m;  this  is  based  on  the  half-range  swath  of 

Ai-  Tlr 

Xo  =  =  483.7  m. 

(Due  to  the  curvature  of  the  radar  radiation  pattern  in  the  FOPEN  SAR  systems,  a  finer 
range  sample  spacing  should  be  used  [S94],  [S95],  [S99].) 

The  SAR  image  of  the  MIT-LL  processor  produces  16384  cross-range  bins  with  sample 
spacing  of  Aj,  =  A^  =  .4  m.  However,  only  the  middle  Uy  =  4096  cross-range  bins 
are  retained,  and  the  remaining  75  percent  of  the  cross-range  bins  are  considered  to  be 
contaminated  (circular  convolution  aliasing)  and,  thus,  are  discarded.  In  this  case,  the 
half- width  of  the  formed  image  in  the  cross-range  domain  is 

Aw  n„ 

Yo  =  =  819.2  m. 


3.  RADIO  FREQUENCY  INTERFERENCE  SUPPRESSION 
VIA  CONVERSION  OF  DERAMPED  SIGNAL 
TO  ECHOED  SIGNAL 

The  FOPEN  SAR  systems  operate  in  the  VHF  and  UHF  frequency  bands  in  which 
various  television  and  communication  channels  reside.  These  sources,  which  are  called 
Radio  Frequency  Interference  (RFI),  contaminate  the  collected  SAR  data.  (There  are 
other  sources  of  RFI,  e.g.,  Electronic  Counter  Measure,  ECM,  signals;  for  a  discussion  and 
treatment  of  ECM  signals  in  SAR  see  [S99]). 

Figure  4a  shows  the  measured  deramped  P-3  data  for  four  consecutive  of  the  radar 
bursts  versus  the  chirp  radar  instantaneous  frequency.  The  resultant  data  can  be  inter¬ 
preted  as  the  SAR  signal  in  the  fast-time  frequency  u>  domain  [S99].  There  are  certain 
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inaccuracies  which  are  involved  in  this  assumption  that  are  called  Residual  Video  Phase 
(RVP).  One  can  use  an  approximation  to  reduce  the  RVP  error  [Car].  The  MIT-LL  P-3 
processor  utilizes  this  scheme  [Besj. 

The  main  problem  with  this  processing  is  not  the  approximation  which  is  used  to 
remove  the  RVP  error.  It  turns  out  that  the  deramping  of  the  measured  signal  results  in 
spectral  spreading  of  the  RFI  signals  in  the  fast-time  frequency  domain.  Figure  5a  shows 
the  two-dimensional  spectrum  (i.e.,  fast-time  frequency  and  slow-time  Doppler  frequency) 
for  a  portion  of  the  P-3  data.  The  horizontal  lines  which  appear  in  this  figure  are  the 
spectrally-spread  RFI  signals. 

The  MIT-LL  processor  utilized  a  scheme  for  RFI  suppression  in  the  P-3  data  that 
is  based  on  placing  notches  in  the  range  equalization  filter  at  the  frequency  bin  locations 
that  are  occupied  by  the  strong  RFI  signals.  As  we  will  show  later,  this  would  result  in 
degradations  in  the  Point  Spread  FUnction  (PSF)  of  the  SAR  image,  particularly  in  the 
range  domain.  In  addition  to  degrading  the  PSF,  there  is  no  study  on  the  effect  of  the 
adverse  effects  of  the  spectral  spreading  of  the  RFI  signals  in  the  reconstructed  SAR  image. 

The  processing  which  we  used  to  encounter  RFI  is  based  on  converting  the  deramped 
data  into  the  baseband  echoed  signal  [S99].  Figure  4b  shows  the  converted  echoed  data 
for  the  deramped  signals  in  Figure  4a.  Note  that  the  RFI  signals  appear  as  fairly  isolated 
signatures  (not  spread)  in  Figure  4b;  for  example  the  audio  and  video  signals  for  Channel 
13  appear  around  215  MHz.  The  two-dimensional  spectrum  of  the  echoed  signal  in  Figure 
5b  exhibits  the  same  phenomenon.  Note  that  an  RFI  signal  which  correspond  to  a  direct 
path  propagation  appears  at  specific  slow-time  Doppler  frequency  (or  angle  of  arrival  with 
respect  to  the  synthetic  aperture).  The  components  seen  at  the  other  Doppler  frequencies 
are  due  to  multi-path  effects  as  well  as  slow-time  incoherence  in  the  broadcasted  TV  signal. 

This  representation  of  the  RFI  signals  allows  the  user  to  utilize  a  variety  of  the  array 
signal  processing  methods  for  RFI  suppression.  However,  one  may  also  use  simple  meth¬ 
ods  for  RFI  suppression.  For  instance,  since  the  direct  path  RFI  signals  pose  the  main 
distortion  in  the  reconstructed  SAR  image,  the  user  may  apply  power  equalization  on  the 
spectral  components  of  the  SAR  signal  where  the  direct  path  signal  is  strong. 

Note  that  the  SAR  wavefront  reconstruction  is  based  on  a  mapping  of  the  spectral 
domain  of  the  SAR  signal  into  the  target  function  spectrum;  this  is  known  as  Stolt  trans¬ 
formation.  Thus,  the  user  may  apply  spectral  filtering  for  RFI  suppression  after  forming 
the  SAR  image.  This  allows  the  user  to  determine  whether  such  an  RFI  filtering,  which 
is  likely  to  degrade  the  PSF  of  the  SAR  system,  is  a  necessary  operation.  In  fact,  our 
results  indicate  that  the  RFI  strongest  signal  in  the  baseband  echoed  signal  results  in  a 
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background  noise  which  is  below  60  dB  of  the  strongest  reflector  in  the  scene.  (This  ob¬ 
servation  is  based  on  coherent  slow-time  processing  of  8192  P-3  pulses;  the  relative  signal 
to  RFI  power  ratio  improves  as  the  number  of  pulses  is  increzised.) 

4.  DIGITAL  SPOTLIGHTING  AND  SLOW-TIME 
(PRF)  UPSAMPLING  FOR  ALIAS-FREE 
DOPPLER  PROCESSING 


A.  MIT-LL  P-3  Processor 

As  we  mentioned  earlier,  the  MIT-LL  P-3  processor  froms  an  image  in  the  spatial 
domain  region  identified  via 

(x,y)  e  [Ac  ±Xo,±F)], 

where  Xc  —  5871.2  m,  Aq  =  483.7  m,  and  Yq  —  819.2  m.  This  area  is  formed  by  processing 
16384  pulses  in  the  synthetic  aperture  interval  of 

u  e  [  -  L,  L]  =  [-3276.8, 3276.8]  m. 


Based  on  these  parameters,  the  maximum  aspect  angle  of  the  desired  target  area  with 


respect  to  is 


^max  =  arctan  (  =  37". 

Ac  Ao 


Thus,  at  the  radar  frequencies  where  the  beamwidth  angle  is  less  than  74  degrees 
(±37"),  the  desired  target  are  is  observable  is  to  the  radar.  If  we  use  the  constraint  on  the 
synthetic  aperture  sample  spacing 


A  <  _ _ 

4  sin^max’ 


with  A„  =  .4  m,  and  0ma.x  =  37",  the  following  condition  should  be  satisfied  by  the 
minimum  wavelength: 

Amin  >  -963  m. 


For  a  74-degree  beamwidth,  this  implies  that  the  P-3  data  contain  aliasing  for  radar  fre¬ 
quencies  which  are  greater  than  -  =  312  MHz. 

However,  this  does  not  imply  that  the  data  from  the  targets  outside  the  desired  az¬ 
imuth  region  (that  is,  \y\  >  Fo)  are  not  aliased.  The  aliased  data  from  these  targets  not 
only  results  in  shifted  and  smeared  images  of  these  targets  (which  is  the  result  of  aliasing 
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fold  over  in  their  own  spectral  band)  but  also  noise-like  artifacts  (which  is  the  result  of 
aliasing  fold  over  in  the  spectral  band  of  the  targets  with  \y\  <  Yo)  that  appear  in  the 
image  of  the  desired  target  area.  The  two-dimensional  spectral  data  in  Figures  5a-b  show 
these  Doppler  aliased  data;  see  the  ramp-looking  signatures.  For  instance,  the  signature  of 
a  target  at  an  aspect  angle  of  40  degrees  appears  alias-free  for  radar  frequencies  which  are 
less  than  300  MHz.  (The  40-degree  ramp  is  cut  at  the  highest  slow-time  Doppler  frequency 
ku  —  S  rad/m,  or  333  Hz,  at  around  300  MHz).  The  signature  of  this  target  at  the  higher 
radar  frequencies  is  folded  over,  and  is  aliased. 

As  we  stated  earlier,  the  beamwidth  angle  of  the  P-3  radar  is  likely  to  be  close  to  50 
degrees  at  the  highest  radar  frequency.  The  coverage  of  the  resultant  data  in  the  spatial 
frequency  domain  of  the  target  image  is  shown  in  Figure  6  via  two  solid  lines.  The  two 
dashed  lines  in  this  figure  show  the  target  spectral  region  when  the  radar  beamwidth  is 
assumed  to  be  35  degrees.  The  lightly  shaded  region  represents  the  processed  spectral 
data  which  are  not  aliased.  The  darker  shaded  area  is  the  aliased  spectral  region  of  the 
50-degree  beamwidth  data. 

Although  the  MIT-LL  P-3  processor  applies  a  35-degree  Doppler  filter  on  the  data 
to  remove  slow-time  Doppler  aliasing.  However,  this  filter  cannot  remove  the  entire  fold- 
over  aliasing  data  which  are  shown  in  Figure  6.  Moreover,  the  35-degree  filter  removes  a 
significant  amount  of  alias-free  data  at  the  lower  radar  frequencies  ([215,400]  MHz)  where 
the  beamwidth  angle  is  well  beyond  50  degrees.  It  turns  out  that  most  of  the  energy  of 
a  FOPEN  radar  that  shows  a  significant  return  from  a  foliage  area  for  concealed  target 
detection  is  within  this  lower  frequency  band. 

We  should  point  out  that  for  a  planar  aperture  radar,  or  a  parabolic  radar  with  a 
feed,  or  a  horn-type  radar  (which  is  used  for  collecting  the  P-3  data)  where  the  beamwidth 
is  a  linear  function  of  the  wavelength,  the  slow-time  Doppler  bandwidth  is  approximately 
invariant  in  the  radar  frequency  [S99].  For  instance,  if  the  P-3  radar  beamwidth  angle  is 
approximately  50  degrees  at  730  Mhz,  then  the  target  function  spectral  support  is  within 
the  two  solid  lines  which  are  shown  in  Figure  7.  The  resultant  fold-over  alising  is  shown 
by  the  dark  shaded  area  in  this  figure. 

In  this  report,  we  examine  the  P-3  reconstructions  of  a  target  area  which  is  composed 
of  a  set  of  Corner  Reflectors  (CRs)  on  a  clear  land  (Stakes  1-11),  and  CRs  which  are  in 
a  foliage  region  (Phenomenology  Stakes  1-2).  Figure  8a  shows  the  location  and  the  types 
of  these  targets;  some  of  these  targets  (Stakes  1,  2,  10  and  11)  are  circled  in  the  aerial 
optical  image  in  Figure  8b;  Phenomenology  Stakes  1  and  2  are  identified  by  two  circles  in 
the  aerial  optical  image  of  Figure  8c. 
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Figures  9a-b  show  the  SAR  images  of  this  target  area  which  axe  formed  by  the  MIT- 
LL  processor.  Figure  9a  represents  what  is  referred  to  as  Level  3  reconstruction.  Figure 
9b  corresponds  to  Level  4  reconstruction  in  which  a  calibration  was  performed  to  improve 
the  results.  (The  calibration  processing  will  be  discussed.)  The  aimpoint  which  was  used 
to  form  these  images  was  at  the  center  of  the  CRs  on  the  clear  land. 

Figures  lOa-b  and  lla-b  show  the  close-ups  of  Level  3  and  Level  4  reconstructions  of 
Stake  10  and  Phenomenology  Stake  2;  the  range  and  cross-range  cuts  for  these  targets  are 
shown  in  Figures  10c  and  11c.  (These  slices  were  obtained  from  the  formed  images  which 
were  clipped  within  a  45  dB  power  interval.) 

Figures  12a-b  show  a  portion  of  the  reconstructed  SAR  image  when  the  aimpoint  is 
at  a  squint  angle  with  respect  to  the  CR  area.  (Level  4  reconstructions  are  shown.)  For 
Figure  12a,  the  aimpoint  is  approximately  650  m  from  the  center  of  the  CR  area;  and  the 
aimpoint  is  approximately  -750  m  from  the  center  of  the  CR  area  in  Figure  12b.  Note  that 
all  the  corner  reflectors  exhibit  severe  range  domain  side  lobes  in  their  images;  this  is  due  to 
the  processing  which  was  used  for  RFI  suppression.  In  addition,  there  two  other  artifacts 
which  can  be  seen  in  the  MIT-LL  processed  P-3  images  which  are  caused  by  processing 
Doppler-aliased  data: 

i.  The  images  contain  fold-over  aliasing  from  the  targets  which  are  outside  the  azimuth 
imaging  area  of  [—^0,^0]  =  [—819,819]  m.  This  form  of  Doppler  aliasing  appears  as 
background  (salt  and  pepper)  noise  that  reduces  the  contrast  of  the  image;  these  are 
clearly  visible  in  Level  3  images  of  Figures  9a,  10a,  and  11a.  After  Level  4  processing 
(i.e.,  calibration),  the  background  noise  becomes  weaker.  The  calibration  which  was 
used  for  Level  4  processing  was  a  one-dimensional  gain  (lowpass-type)  and  phase 
Alter  in  the  cross-range  spatial  frequency  domain  [Bes].  The  fact  that  this  filter  was 
invariant  in  range  can  be  observed  by  noting  that  the  range  domain  side  lobes  are  not 
changed  after  the  calibration. 

ii.  The  images  contain  fold-over  aliasing  from  the  targets  which  are  inside  the  azimuth 
imaging  area  of  [— Fo,Fo]  =  [—819,819]  m.  This  form  of  Doppler  aliasing  results  in 
ghost-like  structures  which  are  the  smeared  and  shifted  replicas  of  the  targets  in  the 
scene  [S92],  [S94],  [S99].  These  ghost  artifacts  get  more  severe  as  the  target  moves 
away  in  the  cross-range  domain  from  the  aim  point.  This  is  due  to  the  fact  that  the 
slow-time  Doppler  aliasing  gets  more  severe  as  the  target’s  squint  angle  with  respect 
to  the  center  of  the  synthetic  aperture  (or  its  azimuth  location)  increases  [S92],  [S94], 
[S99].  This  phenomenon  can  be  seen  in  Figures  12a-b  where  a  squint  aimpoint  was 
used  for  image  formation. 
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The  authors  of  the  MIT-LL  processor  had  attributed  the  ghosting  appearance  of 
the  squinted  targets  (that  is,  the  targets  away  from  the  aimpoint)  to  the  procedure 
which  they  used  for  GPS  motion  compensation  [Bes].  It  turns  out  that  the  MIT- 
LL  processor  performs  GPS  motion  compensation  with  respect  to  the  center  of  the 
target  patch  (aimpoint).  This  is  analogous  to  using  polar  format  processing  (plane 
wave  approximation)  which  introduces  errors  (smearing  and  shifting)  for  off-centered 
(squinted)  targets.  However,  due  to  the  large  mean  range  swath  of  the  P-3  radar  and 
the  relatively  small  range  motion  errors  which  were  recorded  by  the  GPS  system  of 
the  P-3  SAR,  the  severe  ghosts  which  appear  in  the  MIT-LL  images  cannot  be  due 
the  GPS  motion  compensation. 

Due  to  the  effect  of  slow-time  Doppler  aliasing  in  the  SAR  image  which  is  formed 
by  the  MIT-LL  processor  (which  is  most  evident  at  the  larger  azimuth  coordinates)  as 
well  as  the  incorrect  processing  of  the  wavefront  reconstruction  which  resulted  in  the 
circular  convolution  spatial  aliasing^  the  authors  of  the  MIT-LL  processor  discarded  about 
75  percent  of  the  azimuth  data  after  image  formation;  that  is,  they  kept  the  middle  4096 
pixels  in  the  azimuth,  and  discarded  the  other  12292  pixels  at  the  azimuth  boundaries 
(6146  on  each  side)  of  the  image.  The  alias-free  wavefront  reconstruction  processing  which 
is  described  next  also  provides  a  tool  (digital  spotlighting)  for  forming  an  error-free  image 
of  an  area  without  circular  convolution  spatial  aliasing  and  discarding  a  large  amount  data. 

B.  Digital-Spotlight  Alias-Free  Wavefront  Reconstruction 

As  we  stated  earlier,  the  echoed  data  in  Figure  5b  are  aliased  in  the  slow-time  Doppler 
domain.  We  have  developed  a  method  to  exploit  certain  redundancies  in  the  SAR  signal 
to  circumvent  this  aliasing  problem  [S92],  [S94],  [S95],  [S99].  The  approach  is  similar  to 
the  processing  which  we  used  to  retrieve  the  echoed  data  in  the  fast-time  domain  from  the 
deramped  data. 

In  this  approach,  the  SAR  data  are  first  deramped  in  the  slow-time  domain;  this  is 
referred  to  as  the  slow-time  compression.  Then,  a  two-dimensional  filtering  method  is  used 
to  extract  the  signature  of  the  desired  target  area  (that  is,  \x  —  Xc\  <  Xq  and  \y\  <  F)) 
from  the  slow-time  compressed  SAR  data;  this  is  called  digital-spotlighting.  Then,  based 
on  the  slow-time  sampling  constraint  of 

A  <  _ _ 

4  sin  ^max 

the  digitally-spotlighted  slow-time  compressed  data  are  upsampled  in  the  slow-time  domain. 
Finally,  the  alias-free  SAR  signature  of  the  desired  target  area  is  formed  by  ramping  the 
upsampled  data. 
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This  scheme  has  been  used  for  processing  of  the  SRI  data  and  Inverse  SAR  (ISAR) 
data  of  airborne  targets  [S94],  [S99].  We  use  this  procedure  for  imaging  a  subpatch  target 
region  with  Xq  =  150  m  and  Yq  =  300  m  at  the  center  of  the  CR  area  of  Figure  8  using  the 
P-3  data.  For  this,  we  only  use  8192  pulses  of  the  P-3  data,  that  is,  one-half  of  the  database 
which  is  used  in  the  MIT-LL  P-3  processor.  There  are  two  reasons  for  this.  First,  for  alias- 
free  processing  of  this  target  area,  the  slow-time  sampling  constraint  becomes  A„  <  .19  m; 
that  is,  we  need  to  upsample  the  P-3  data  by  more  than  a  factor  of  two  that  results  in  more 
than  32000  slow-time  samples.  Due  to  the  memory  limitation  of  the  PC  computer  that  we 
used,  this  was  infeasible.  (We  should  point  out  again  that  for  alias-free  reconstruction  with 
the  backprojection  algorithm,  the  user  has  to  process  the  same  large  upsampled  database.) 

The  other  reason  for  processing  8192  P-3  pulses  is  that  the  resultant  alias-free  recon¬ 
struction  exhibits  target  images  which  axe  superior  to  the  aliased  MIT-LL  reconstructions 
with  16384  pulses.  (In  fact,  the  alias-free  reconstruction  with  4096  P-3  pulses  exhibit 
better  contrast  than  the  MIT-LL  reconstructions  with  16384  pulses.) 

Based  on  the  constraint  A„  <  .  j," — ,  the  8192  P-3  pulses  are  upsampled  (in 

41  Sin  C7rnax 

addition  to  the  digital  spotlighting)  to  yield  11358  pulses  with  sample  spacing  of  A„  = 
.2885  m.  Then,  after  slow-time  Doppler  processing  (that  is,  Fourier  transform  from  the 
synthetic  aperture  u  domain  to  the  ku  domain),  the  data  could  be  down-sampled  to  satisfy 
the  Nyquist  constraint  for  imaging  azimuth  region  |y|  <  To;  this  yields  abut  3800  azimuth 
spatial  frequency  ky  samples. 

After  appropriate  phase  processing  and  interpolation,  the  two-dimensional  target  spec¬ 
trum  in  Figure  13  is  obtained.  Figure  9c  shows  the  resultant  spatial  domain  reconstruction; 
this  image  is  analogous  to  the  uncalibrated  Level  3  MIT-LL  reconstruction  in  Figure  9a. 
Figure  9d  shows  the  alias-free  wavefront  reconstruction  after  calibration.  (The  processing 
for  calibration  will  be  discussed.)  Figures  lOd-e,  respectively,  show  the  uncalibrated  and 
calibrated  alias-free  reconstructions  of  Stake  10;  the  range  and  cross-range  slices  through 
these  images  are  shown  in  Figure  lOf.  (MIT-LL  reconstructions  of  Stake  10  are  shown  in 
Figures  lOa-c.) 

Figures  lld-f  show  similar  alias-free  wavefront  reconstruction  results  for  Phenomenol¬ 
ogy  Stake  2.  (MIT-LL  reconstructions  of  Phenomenology  Stake  2  are  shown  in  Figures 
lla-c.)  Figures  14a-c,  15a-c,  and  16a-c  are  the  alias-free  wavefront  reconstructions  for 
Stakes  2,  3,  and  9.  Note  that  all  the  image  of  the  trihedrals  also  show  a  curved  structure 
behind  the  CR.  This  signature  is  due  to  multipath  effects  inside  a  trihedral  which  results 
in  multiple  delayed  echoes  in  addition  to  the  primary  echoed  signal  form  the  CR.  Un¬ 
like  the  primary  echo,  the  multipath  echoes  are  not  coherent  in  the  slow-time  (synthetic 
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aperture)  domain  to  form  a  focused  image.  These  echoes  correspond  to  a  (non-existent) 
moving  target  in  the  slow-time  domain;  the  curved  structure  is  the  result  of  imaging  such 
an  imaginary  moving  target. 

5.  TWO-DIMENSIONAL  SPECTRAL  DOMAIN 
IN-SCENE  TARGET  CALIBRATION 

SAR  wavefront  reconstruction  theory  shows  that  the  SAR  data  has  to  be  matched- 
filtered  in  the  two-dimensional  fast-time  and  slow-time  spectral  domains  by  appropriate 
phase  functions  of  the  radar  radiation  pattern  and  the  radar  signal  spectrum  [S99].  For 
this,  the  user  should  determine  these  phase  functions  in  a  noise-free  environments;  these 
are  referred  to  as  calibration  measurements. 

Calibration  measurements  for  the  P-3  radar  were  apparently  made.  The  authors  of 
MIT-LL  processor  used  a  scheme  to  exploit  these  measurements  in  conjunction  with  in¬ 
scene  targets  signatures  to  construct  a  one- dimensional  calibration  signal  which  varies 
with  the  azimuth  (cross-range)  spatial  frequency,  but  was  invariant  in  the  range  spatial 
frequency  [Bes]. 

As  we  stated  earlier,  the  calibrated  MIT-LL  P-3  image  weis  called  Level  4  recon¬ 
struction.  The  Level  3  (uncalibrated)  and  Level  4  MIT-LL  reconstructions  were  shown  in 
Figures  9a-b,  and  for  two  individual  targets  in  Figures  lOa-c  and  lla-c.  Note  that  these 
images  show  that  the  point  spread  function  (PSF)  improves  in  the  azimuth  domain  for 
Level  4  reconstruction;  the  same  is  not  true  for  the  range  domain  PSF  (the  range  side 
lobes)  due  to  the  invariance  of  the  one-dimensional  calibration  in  the  range  domain. 

To  determine  the  nature  of  the  calibration  phase,  we  examined  the  spectral  phase 
of  the  CRs  in  the  alias-free  image  of  Figure  9c.  These  are  shown  in  Figures  lOg,  llg, 
14d,  15d,  and  16d  for  Stake  10,  Phenomenology  Stake  2,  Stake  2,  Stake  3,  and  Stake  9, 
respectively.  (To  exhibit  the  phase  information,  the  magnitude  of  the  real  part  of  these 
two-dimensional  spectra  are  shown  in  these  figures;  the  multipath  effects  are  not  included 
to  compute  these  spectra.)  Note  that  these  spectra  all  show  a  phase  (fringe)  pattern  which 
varies  with  both  the  range  spatial  frequency  and  the  azimuth  spatial  frequency  ky. 
(Note  that  the  calibration  signal  for  the  MIT-LL  processor  was  a  one- dimensional  signal 
of  the  azimuth  spatial  frequency  ky.) 

Moreover,  this  phase  pattern  is  approximately  radially  symmetric  in  the  spatial  fre¬ 
quency  domain.  The  latter  observation  indicates  that  the  calibration  phase  is  mainly  due 
to  the  phase  of  the  chirp  radar  spectrum;  that  is,  an  additional  phase  is  modulating  the 
chirp  which  has  to  be  matched  filtered  (removed)  [S99].  In  our  processing,  we  use  the 
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two-dimensional  spectral  signature  of  Stake  10  for  calibration.  The  results  are  shown  in 
Figure  9d,  and  for  the  individual  targets  in  Figures  lOe,  lie,  14b,  15b,  and  16b.  (The 
range  and  cross-range  slices  for  these  targets  appear  as  solid  lines  in  Figures  lOf,  Ilf,  14c, 
15c,  and  16c.) 

6.  SQUINT-MODE  WAVEFRONT  RECONSTRUCTION 

In  Figures  12a-b,  we  showed  the  reconstruction  results  for  the  MIT-LL  processor  when 
the  aimpoint  was  moved  to  650  m  and  -750  m  from  the  center  of  the  CR  area;  these  are 
referred  to  as  squnit  aimpoints.  Due  to  the  processing  of  aliased  data  by  the  MIT-LL 
P-3  processor,  these  squint-mode  images  exhibited  severe  ghost-like  signatures  of  the  CRs. 
The  alias-free  wavefront  reconstruction  of  Section  4  may  also  be  used  for  error-free  mage 
formation  in  squint  mode  [S94],  [S95],  [S99]. 

To  show  this,  consider  a  200  m  by  300  m  target  area  (Xq  =  100  m  and  lo  =  150  m) 
centered  at  the  range  Xc  =  5600  m.  This  area  contains  a  CR  (Stake  1)  and  some  dense 
foliage.  The  broadside  digitally-spotlighted  reconstruction  of  this  area  with  512  P-3  pulses 
is  shown  in  Figure  17a;  for  the  broadside  reconstruction,  the  512  pulses  are  centered  at 
the  azimuth  y  =  0.  Figure  17b  is  the  spectrum  of  this  reconstruction. 

We  should  point  out  that  after  digital  spotlighting,  the  Nyquist  sampling  constraint 
in  the  slow-time  domain  becomes  A„  <  1.42  m  which  is  smaller  than  the  slow-time  sample 
spacing  of  A„  =  .4  m  of  the  measured  P-3  data.  In  this  case,  the  digitally-spotlighted 
P-3  data  is  down-sampled  accordingly  by  our  processor.  (Note  that  the  azimuth  spectral 
support  in  Figure  17b  is  smaller  than  the  slow-time  Doppler  support  of  the  original  P-3 
data  in  Figure  5b.)  Figure  17c  shows  the  spectral  fringe  pattern  (real  part)  of  Stake  1 
signature. 

Next,  we  consider  512  pulses  of  the  P-3  data  which  are  approximately  2500  m  away 
from  the  center  of  the  target  area  in  the  azimuth  domain;  this  corresponds  to  an  approxi¬ 
mately  25  degrees  of  squint  angle.  Figure  18a  shows  the  squint  reconstruction  of  the  target 
region;  Figure  18b  shows  the  reconstruction  for  the  frequency  band  [230,515]  MHz.  It  turns 
out  that  the  P-3  data  at  this  squint  angle  for  the  radar  frequencies  which  are  greater  than 
515  MHz  are  very  weak.  We  believe  that  this  is  due  to  the  6:1  subsampling  slow-time 
filter  that  was  applied  on  the  P-3  data  [Bes],  [Raw].  Figure  18c  shows  the  spectrum  of  the 
reconstructed  image;  Figure  18d  is  the  spectral  fringe  pattern  of  Stake  1. 
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HIGH  VELOCITY  PENETRATION  OF 
LAYERED  GROUT  TARGETS 


Joseph  W.  Tedesco 


Introduction 


In  recent  years  the  penetration  of  concrete  and  grout  targets  by  small-scale  projectiles  has 
received  considerable  attention.  The  motivation  for  this  directed  effort  is  the  trend  in  new 
designs  for  manned  Air  Force  aircraft  which  employ  internal  weapon  carriage.  As  a  result  of  the 
mass  and  volume  constraints  imposed  by  this  requirement,  the  development  of  smaller 
penetrators  for  air  to  surface  munitions  is  being  initiated  by  the  Air  Force. 

A  critical  issue  associated  with  the  deployment  of  smaller  penetrators  (e.g.  250  lb  class  MMT)  is 
that  they  meet  or  exceed  the  performance  criteria  of  the  currently  deployed  2000  lb  penetrator. 

To  achieve  this  goal,  the  smaller  penetrator  must  be  capable  of  performing  at  impact  velocities  in 
excess  of  4000  fps.  Indeed,  impact  velocities  in  this  range  impose  severe  demands  on  the 
projectile  which  will  experience  significant  nose  erosion  [1,2].  This  has  prompted  the 
investigation  of  new,  high-strength  metal  alloys  for  the  penetrator.  However,  the  performance  of 
these  new  penetrator  materials  is  beyond  the  scope  of  this  report. 

There  currently  exists  a  substantial  database  for  high  velocity  penetration  of  small-scale 
projectiles  into  grout  and  concrete  targets.  The  most  comprehensive  of  these  studies  have  been 
conducted  at  Sandia  National  Laboratories  (SNL)  by  Forrestal  et  al  [3-7].  However,  the  entire 
SNL  database  is  for  monolithic  grout  or  concrete  targets.  This  has  raised  several  concerns 
among  investigators  with  respect  to  the  correlation  of  monolithic  penetration  data  with  layered 
target  penetration. 

Interest  in  the  penetration  of  layered  grout  or  concrete  targets  has  been  piqued  for  several 
reasons.  One  consideration  is  due  to  practical  aspects  associated  with  the  construction  of 
hardened  facilities,  specifically,  the  use  of  cold  joint  construction.  A  second  consideration  is 
more  strategic  in  nature,  that  is  the  case  of  multiple  burster  layer  slabs  employed  to  influence 
penetrator  trajectory.  A  third  consideration  is  an  idiosyncracy  of  full  scale  testing,  that  is,  the  use 
of  a  multiple  layer  target  configuration  to  represent  an  equivalent  monolithic  target.  For  these 
reasons,  the  present  study  was  initiated. 

Objectives  and  Scope 

The  primary  objective  of  this  study  is  to  conduct  scaled  high  velocity  penetration  experiments 
into  layered  grout  targets  and  compare  the  results  with  monolithic  target  data  in  an  attempt  to 
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quantify  penetration  enhancement.  A  second  objective  is  to  identify  characteristics  of  concrete 
penetration  peculiar  to  layered  target  systems. 

However,  prior  to  implementation  of  the  scaled  experimental  study,  a  test  plan  was  based  upon  a 
comprehensive  analytical  study  using  the  SAMPLL  [8]  computer  program.  The  results  of  the 
analytical  study  were  then  used  to  establish  the  layered  target  configurations  to  be  tested  and  the 
penetrator  impact  velocities. 

Analytical  Study 

The  analytical  study  was  conducted  to  establish  the  layered  target  configurations  to  be 
investigated  in  the  experimental  study  as  well  as  the  appropriate  impact  velocities  for  the  small 
scale  penetrators.  To  provide  a  meaningful  comparison  with  the  monolithic  data,  all  layered 
target  configurations  had  a  total  thickness  of  36  in,  since  the  monolithic  targets  were  30  in 
diameter  cylinders  of  36  in  length.  Furthermore,  the  layered  configurations  investigated  were 
selected  based  upon  the  inventory  of  grout  plate  targets  available.  All  plate  targets  were  32  in 
square,  cast  in  four  different  thicknesses.  The  inventory  of  available  grout  targets  is  presented  in 
Table  1.  The  physical  characteristics  of  the  small  scale  penetrator  are  illustrated  in  Figure  1  and 
summarized  in  Table  2. 


Table  1.  Inventory  of  Grout  Plate  Targets 


Number 

Thickness  (in) 

UNC  (psi) 

Density 

(Ib/cu.ft.) 

10 

4 

6409 

140.4 

10 

8 

8157 

139.8 

8 

12 

7856 

139.1 

5 

16 

7856 

139.1 

Table  2.  Small  Scale  Projectile  Data 


Type 

4340-1 

Weight 

0.15  lb 

Length 

3.5  in 

Diameter 

0.5  in 

CRH 

3.0 

Yield  Strength 

250  ksi 

Charpy  Impact 

18ft-lb 
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The  layered  targets  considered  in  the  analytical  study  are  summarized  in  Table  3.  For  each 
layered  configuration  (LC)  a  penetration  curve  was  developed.  Penetration  curves  for  six 
different  layered  configurations  and  the  corresponding  monolithic  target  is  presented  in  Figure  2. 
The  percent  increase  in  penetration  depth  exhibited  by  four  of  the  layered  target  configurations 
relative  to  the  monolithic  target  is  presented  in  Figure  3. 

Table  3.  Layered  Grout  Targets  Investigated  in  Analytical  Study 


Target  Configuration 

Layer  Thickness  (in) 

LCl 

4-16-16 

LC2 

12-12-12 

LC3 

4-8-8-16 

LC4 

4-4-8-8-12 

LC5 

8-16-12 

LC6 

8-12-16 

LC7 

8-8-8-12 

From  the  results  of  the  analytical  study  a  test  plan  was  devised.  It  was  decided  to  investigate  five 
different  layered  configurations  at  a  penetrator  velocity  of  1000  m/s,  and  four  of  these  same 
configurations  at  1200  m/s.  The  final  test  plan  is  summarized  in  Table  4.  Layered  target 
configuration  LC8  was  tested  at  1000  m/s  only  due  to  the  availability  of  4  in  plate  targets. 

Table  4.  Final  Test  Plan  for  Layered  Grout  Targets 


Target  Configuration 

Layer  Thicknesses  (in) 

Impact  Velocity  (m/s) 

LCl 

4-16-16 

1000  and  1200 

LC2 

12-12-12 

1000  and  1200 

LC3 

4-8-8-16 

1000  and  1200 

LC7 

8-8-8-12 

1000  and  1200 

LC8 

4.4-4-4.4-I6 

1000  only 
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Small  Scale  Experimental  Study 


The  small  scale  projectile  illustrated  in  Figure  1  was  fired  at  the  specified  velocities  into  the 
layered  grout  target  configurations  described  in  Table  4.  The  delivery  device  used  for  the 
experiments  was  the  50  mm  gun  located  on  Range  C-64  at  Eglin  Air  Force  Base,  FL.  A 
schematic  of  the  gun  and  the  high  velocity  concrete  penetration  experiments  (HVCP)  is 
presented  in  Figure  4.  Between  the  front  of  the  gun  and  the  plate  targets,  two  velocity  screens 
(V,  and  V2)  and  five  x-ray  cameras  (T,  through  T5)  were  situated.  The  pitch  of  the  projectile, 
before  impact,  was  ascertained  from  cameras  T,,  T2  and  Tj,  and  the  projectile  yaw  determined 
from  cameras  T4  and  T5.  Moreover,  the  velocity  determined  from  the  velocity  screens  was 
verified  from  cameras  Tj  and  Tj.  A  pre-test  photograph  of  target  configuration  LCl  is  presented 
in  Figure  5.  To  insure  the  best  possible  contact  between  the  individual  plates,  a  system  of  angle 
irons  and  threaded  rods  was  employed  at  both  the  top  and  bottom  of  the  target  configuration  as 
illustrated  in  the  photograph. 

In  an  attempt  to  quantify  the  penetration  enhancement  exhibited  by  the  layered  grout  targets, 
comparisons  were  made  with  the  average  penetration  experienced  by  several  monolithic  targets 
at  specified  velocities.  This  was  done  for  several  reasons:  1)  the  projectiles  used  for  the 
monolithic  target  tests  were  of  a  different  material  than  those  used  for  the  layered  target  tests; 
and  2)  the  unconfined  compressive  strength  (UNC)  of  the  monolithic  targets  was  the  same  as  that 
for  the  8  in  plate  targets  (8157  psi),  but  different  than  that  for  the  4  in.  12  in  and  16  in  plates 
(refer  to  Table  1). 

The  HVCP  depth  of  penetration  results  for  the  monolithic  targets  are  presented  in  Table  5  for 
1000  m/sec  and  in  Table  6  for  1200  m/sec.  A  post-test  photograph  of  monolithic  test  HVCP-41 
(AF-1410  projectile  at  1200  m/sec)  is  presented  in  Figure  6. 

Table  5.  Penetration  into  Monolithic  Targets  (1000  m/sec) 


Test 

Projectile 

Velocity 

(m/sec) 

Max.  Pitch 
(deg) 

Max  Yaw 
(deg) 

'  Penetration 
(in) 

HVCP-58 

ES-1 

1073 

6.5  up 

9.0 

12.0' 

HVCP-40 

AF-1410 

1000 

0 

2.40  r 

13.25 

12.63  (ave) 
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Table  6.  Penetration  into  Monolithic  Targets  (1200  m/sec) 


Test 

Projectile 

Velocity 

(m/sec) 

Max.  Pitch 
(deg) 

Max  Yaw 
(deg) 

Penetration 

(in) 

HVCP-53 

AerMet-100 

1220 

l.O  up 

3.0  r 

19.69 

HVCP-4I 

AF-1410 

1218 

1.70  dn 

2.44  r 

20.31 

20.0  (ave) 

The  HVCP  depth  of  penetration  results  for  the  layered  grout  targets  are  presented  in  Table  7  for 
1000  m/sec  and  in  Table  8  for  1200  m/sec.  Post-test  photographs  of  layered  target  test  HVCP-69 
are  presented  in  Figures  7  through  10.  Figure  7  shows  the  front  face  of  the  4  in  layer  and  Figure 
8  shows  the  back  face  of  the  same  layer.  Figures  9  and  10  show  the  front  face  and  back  face, 
respectively  of  the  first  1 6  in.  layer.  Post-test  photographs  of  the  front  face  and  back  face, 
respectively,  of  the  first  12  in  layer  from  test  HVCP-64  are  presented  in  Figures  1 1  and  12. 


Table  7.  Penetration  into  Layered  Targets  (4340-1  @  1000  m/sec) 


Test 

Target 

Velocity 

(m/sec) 

Max  Pitch 
(deg) 

Max  Yaw 
(deg) 

Layers 

Penetrated 

Penetration 

(in) 

%  Increase  in 
Penetration 

HVCP-64 

LC2 

982 

4.5  dn 

2  1 

2 

13.35 

5.7 

HVCP-65 

LCl 

1107 

5.5  up 

6  r 

2 

12.725 

0.7 

HVCP-66 

LC3 

1070 

8.5  dn 

7.5  r 

J) 

15.31 

21.2 

HVCP-67 

LC7 

947 

0 

1.0  r 

2 

13.0 

2.9 

HVCP-72 

LC8 

970 

17  dn 

3.5  1 

4 

13.125 

3.9 

6.9  (ave) 

HVCP-68 

LC2 

1227 

1.5  dn 

4.0  1 

2 

20.125 

0.6 

HVCP-69 

LCl 

1278 

3.5  dn 

0 

-> 

J) 

22.345 

1 

1 1.9 

HVCP-70 

LC3 

1236 

1.5  dn 

2.0  1 

4 

21.188 

5.9 

HVCP-71 

LC7 

1240 

0 

7.0  1 

20.0 

9.4 

6.9  (ave) 

33-7 


Post-Test  Observations 


The  results  of  the  HVCP  tests  provided  a  number  of  interesting  observations  with  respect  to 
penetration,  projectile  tunneling  through  layers,  front  face  and  back  face  cratering  in  the 
individual  layers,  and  the  development  of  a  compaction  zone  around  the  projectile  tunnel.  This 
is  especially  evident  when  the  HVCP  test  results  for  the  layered  targets  are  compared  with  the 
HVCP  monolithic  test  results. 

With  respect  to  penetration,  the  layered  target  configurations  exhibited  some  enhancement,  but 
not  nearly  as  much  as  predicted  by  the  analytical  study.  For  example,  for  LC3,  the  predicted 
increase  in  penetration  was  almost  50%  for  1000  m/sec,  as  illustrated  in  Figure  13.  However,  the 
actual  penetration  enhancement  was  21.2%  exhibited  in  test  HVCP-66  as  presented  in  Table  6. 
For  this  same  configuration  at  1200  m/sec,  the  predicted  increase  in  penetration  was 
approximately  45%  as  shown  in  Figure  14.  The  observed  increase  in  penetration  for  layered  test 
HVCP-70,  however,  was  only  5.9%.  In  fact,  the  average  increase  in  penetration  for  all  layered 
configurations  tested  was  approximately  7%,  for  both  the  1000  m/sec  and  1200  m/sec  HVCP 
tests. 

The  tunneling  phenomenon  was  readily  observed  through  the  individual  layers.  Well  defined 
tunnels  of  approximately  V2  in  diameter  were  clearly  manifested  throughout  all  layers  penetrated. 
In  direct  contrast  to  monolithic  targets,  this  same  observation  cannot  be  made  without  conducting 
laborious  coring  and  slicing  operations. 

The  cratering  aspects  of  the  penetration  phenomenology  presented  some  interesting  observations 
in  the  layered  target  configurations.  The  cratering  observed  in  the  front  faces  of  the  first  layer  of 
the  layered  target  configurations  was  generally  consistent  with  the  cratering  observed  in  the  faces 
of  the  monolithic  targets,  both  in  terms  of  crater  depth  and  diameter.  One  distinct  dissimilarity, 
however,  was  that  the  monolithic  target  faces  generally  exhibited  a  greater  number  of  radial 
cracks  than  the  first  layer  front  faces  of  the  layered  target  configurations.  This  observation  is 
clearly  evidenced  in  Figures  6  and  7. 

Cratering  on  the  target  faces  at  layer  interfaces  exhibited  two  distinct,  and  quite  different, 
appearances.  In  some  instances,  there  was  virtually  no  evidence  of  a  crater  forming.  The  tunnel 
was  surrounded  by  a  circular  compaction  zone  having  a  radius  of  approximately  2  to  3  penetrator 
diameters.  Where  radial  cracks  developed,  they  were  initiated  outside  the  circumference  of  the 
compaction  zone.  This  observation  is  clearly  illustrated  in  Figures  8  and  9.  Howex  er,  in  direct 
contrast  to  this  behavior,  in  some  instances  cratering  quite  similar  to  that  observed  in  the  front 
face  of  the  first  layer  in  a  layered  target  configuration,  or  on  the  face  of  a  monolithic  target, 
developed  in  the  interior  layer  interfaces.  This  is  clearly  demonstrated  in  Figure  10. 

At  this  early  stage  in  the  investigation  of  high  velocity  penetration  into  layered  grout  targets,  it  is 
quite  difficult  to  ascertain  the  precise  reason  for  these  diverse  observations  at  the  layer  interfaces. 
One  possible  explanation  is  the  degree  or  completeness  of  contact  between  the  individual  layers. 
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Because  many  of  the  plate  target  faces  were  not  level  or  bowed  (either  concave  or  convex),  gaps 
were  formed  in  many  of  the  interfaces  in  the  various  layered  target  configurations  tested.  These 
gaps  may  have  contributed  to  the  cratering  observed  at  the  layer  interfaces.  However,  at 
interfaces  where  good  contact  between  layers  was  maintained,  confinement  was  preserved  and 
crater  formation  did  not  occur. 

Conclusions  and  Recommendations 


The  results  of  the  experimental  study  have  uncovered  several  interesting  aspects  regarding  the 
high  velocity  penetration  into  layered  concrete  or  grout  targets.  There  does  appear  to  be  some 
penetration  enhancement  in  layered  targets  in  comparison  to  corresponding  monolitliic  targets. 
However,  the  increase  in  penetration  does  not  appear  to  be  as  great  as  predicted  by  the 
preliminary  analytical  study.  Moreover,  there  are  several  factors  influencing  penetration  in 
layered  targets  which  are  not  a  consideration  in  penetration  of  monolithic  targets.  These  factors 
include  the  number  of  layers  penetrated,  the  relative  thickness  of  the  individual  layers  and  the 
contact  condition  at  layer  interfaces. 

The  results  of  the  experimental  study  generally  indicate  that  penetration  depth  increases  as  the 
number  of  layers  penetrated  increases.  The  exception  to  this  trend  is  HVCP-72  in  which  the 
penetration  enhancement  percentage  is  only  3.9%,  well  below  the  average  for  all  targets  ot  6.9%. 
However,  this  particular  test  exhibited  an  extremely  large  amount  of  pitch  (17  degrees)  which 
undoubtedly  reduced  the  potential  penetration  significantly. 

Probably,  the  single  most  important  factor  influencing  penetration  of  layered  targets  is  the 
contact  condition  at  layer  interfaces.  As  discussed  in  the  previous  section,  this  condition  aflects 
the  cratering  phenomenon  at  layer  interfaces.  Interfaces  experiencing  good  contact  conditions 
and  confinement  exhibit  less  cratering  around  the  tunnel  than  do  interfaces  with  gaps.  However, 
it  cannot  be  ascertained  whether  these  interface  gaps  enhance  or  mitigate  penetration. 

To  address  these  uncertainties  concerning  penetration  into  layered  target  systems,  it  is 
recommended  that  a  second  phase  of  small  scale  experiments  be  conducted.  In  the  Phase  II 
experimental  study  it  is  recommended  that  penetration  curves  be  developed  tor  a  limited  number 
of  layered  target  configurations.  It  is  suggested  that  penetrator  impact  velocities  range  from 
1000  m/s  to  1600  m/s.  Moreover,  it  is  further  recommended  that  two  separate  series  of 
experiments  be  conducted  for  each  target  configuration.  The  first  series  should  be  conducted 
with  full  contact  between  all  layer  interfaces,  and  the  second  series  be  conducted  on  target 
systems  having  specified  gaps  between  layers.  These  series  experiments  should  lend  some 
insight  into  quantifying  the  effect  of  the  contact  condition  at  layer  interfaces  on  penetration. 
Some  possible  layered  target  configurations  that  might  be  investigated  in  the  Phase  11 
experiments  are  summarized  in  Table  8. 


33-9 


Table  8.  Layered  Target  Conflgurations  for  Phase  II  Experiments 


No.  of  Layers 

Layer  Thicknesses 
(in) 

3 

12-12-12 

4 

8-8-8-12 

7 

4.4.4.4.4.4.12 

Finally,  to  further  to  investigate  and  understand  the  penetration  phenomenon  into  layered  target 
systems,  it  is  recommended  that  a  comprehensive  parametric  study  be  conducted  using  the 
Lagrangian  finite  element  hydrocode  EPIC  [9],  The  smooth  particle  hydrodynamics  (SPH) 
algorithms  in  EPIC  make  it  possible  to  analyze  both  severe  distortions  and  structural  responses 
with  a  single  Lagrangian  code. 
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Calculation 
history  points 


Figure  1.  Small  scale  projectile 


Figure  2.  Penetration  curves  for  layered  grout  targets. 
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Figure  3.  Increase  in  penetration  depth  in  layered  targets. 
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Figure  4.  Schematic  of  50  mm  HYCP  set-up 


SIDE  VIEW 
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Figure  7.  HVCP-69  If  (4340-1  0  1200  m/s  into  4/16/16). 
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Figure  8.  HVCP-69  IB. 
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Figure  9.  HVCP  -  69  2F, 
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ENHANCEMENTS  TO  A  DIRECT  AEROELASTIC  STABILITY  COMPUTATIONAL  MODEL 


J.  Mitch  Wolff 
Assistant  Professor 

Department  of  Mechanical  and  Materials  Engineering 
Wright  State  University 


Abstract 

The  ability  to  accurately  and  efficiently  predict  transonic  flutter  boundaries  is  investigated 
using  an  enhanced  direct  computational  method.  Steady  characteristic  and  unsteady  approximate 
non-refiecting  characteristic  farfield  boundary  conditions  are  utilized  to  more  accurately  model  the 
aerodynamic  flow  physics  in  a  direct  method.  In  addition,  the  aerodynamic  model  is  modified  to 
lock  the  movement  of  the  farfield  grid  points  while  allowing  the  airfoil  surface  points  to  move  freely. 
This  is  accomplished  by  introducing  a  linear  weighting  function  in  the  grid  deformation  model. 
The  direct  method  is  based  on  a  discretization  of  the  Euler  equations  and  a  coupled  set  of 
structural  dynamics  equations  representative  of  a  pitch-and-plunge  airfoil  with  trailing  edge  flap. 
The  coupled  equations  are  expanded  to  specify  a  Hopf-bifurcation  point,  which  defines  an 
incipient  flutter  state.  Since  the  enhanced  farfield  boundary  conditions  improve  the  modeling  of 
the  flow  physics,  the  computational  domains  can  be  reduced  in  size  over  similar  models  with 
simple  freestream  conditions.  A  flapped  NACA  64A006  airfoil,  executing  pitching  and  plunging 
motion,  is  utilized  to  demonstrate  the  ability  of  the  enhanced  direct  method  to  accurately  and 
efficiently  calculate  flutter  boundaries  for  reduced  domain  sizes.  Both  zero  and  non-zero  angle  of 
attack  results  are  shown  to  highlight  the  improved  accuracy  and  efficiency  of  the  enhanced  inodel. 
Each  model  modification  resulted  in  analysis  improvements,  with  the  steady  characteristic  model 
demonstrating  significant  improvements  in  the  nonlinear  flow  regime.  Fqr  a  1°  static  pre-twist 
analysis  at  a  transonic  freestream  Mach  number  of  0.84,  the  enhanced  model  resulted  in  over  a 
75%  decrease  in  the  flutter  speed  error.  Therefore,  the  capability  to  more  accurately  and  efficiently 
model  transonic  flow  conditions  with  strong  shock  interaction  has  been  shown. 
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ENHANCEMENTS  TO  A  DIRECT  AEROELASTIC  STABILITY  COMPUTATIONAL  MODEL 


Tim  J.  Leger 
J.  Mitch  Wolff 


Introduction 

In  the  design  of  flexible  aircraft  structures,  it  is  important  to  determine  accurately  and 
efficiently  the  flutter  boundaries  within  which  aircraft  can  safely  operate.  Using  conventional 
computational  fluid  dynamics  (CFD)  methods,  these  boundaries  are  estimated  through  time 
integration  of  the  governing  dynamical  (fluid  and  structure)  equations.  In  a  design  environment, 
where  a  large  number  of  configurations  are  examined,  time  integration  with  CFD  techniques  can 
be  unsystematic  and  prohibitively  inefficient.  However,  efficient  aerodynamic  methods  currently 
used  within  aeroelastic  design  procedures  are  typically  based  on  linear  aerodynamics  and,  thus, 
are  not  well  suited  to  transonic  flutter  computations.  There  is  a  need  for  a  robust  method  of  flutter 
prediction  of  sufficiently  high  fidelity  to  capture  properly  the  nonlinear  aerodynamics  characteristic 
of  transonic  flight,  but  with  sufficient  efficiency  to  bring  nonlinear  aerodynamics  to  the  design 
environment. 

Recently,  direct,  or  non-time-integration,  methods  have  been  developed  that  are  suitable 
for  both  flutter  prediction  and  steady  aeroelastic  analysis.  Morton  and  Beran  (1995,1996A) 
developed,  implemented  and  validated  the  first  direct  method  for  the  calculation  of  flutter  points  in 
2-D  transonic  flow.  The  implicit  methodology  provides  the  ability  to  compute  the  critical  speed,  as 
an  unknown  of  an  expanded  system  of  equations,  at  which  the  equilibrium  fluid-structure  system 
loses  stability  to  periodic  motions.  For  a  soft  flutter  condition,  the  critical  speed  is  the  speed  at 
which  limit-cycle  oscillation  (LCO)  is  first  possible,  and  becomes  crrtical  at  a  supercritical  Hopf- 
bifurcation  point. 

There  are  advantages  and  disadvantages  of  the  use  of  direct  methods  for  flutter  prediction 
relative  to  that  of  time-integration  methods.  The  advantages  are  primarily  three-fold:  (1)  direct 
methods  avoid  the  need  for  bracketing,  i.e.,  the  repetitious  application  of  a  time-integration 
method  to  obtain  stable  and  unstable  equilibrium  solutions  about  a  presumed  critical  state,  (2)  the 
computational  effort  of  direct  methods  is  not  time-step  sensitive,  whereas  the  computational  effort 
of  time  integration  is  inversely  proportional  to  the  time  step,  (3)  direct  methods  obtain  an  explicit 
representation  of  the  aeroelastic  mode  that  is  associated  with  the  loss  of  stability  in  the  system.  The 
primary  disadvantage  of  direct  methods  to  date  has  been  the  fully  implicit  and  computationally 
expensive  solution  strategies  employed  to  compute  simultaneously  the  aeroelastic  solution,  critical 
aeroelastic  mode,  flutter  speed,  and  flutter  frequency.  Still,  for  grids  typical  of  inviscid,  2-D, 
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transonic  airfoil  calculations  (2000-3000  grid  points),  Beran  (1998)  observed  significant 
improvements  (a  factor  of  50)  in  computational  performance  of  a  direct  approach  over  an  explicit, 
total-variation  diminishing,  time-integration  scheme. 

Direct  methods  based  on  bifurcation  analysis  have  been  improved  and  applied  in  several 
recent  studies.  Beran  and  Morton  (1997)  added  a  solution-path  tracing  capability  to  the  direct 
solver  called  BIFAE  to  allow  flutter  boundaries  to  be  automatically  computed,  and  made  numerous 
fundamental  improvements  to  their  algorithmic  procedure.  Morton  and  Beran  (1996B)  studied  the 
flutter  characteristics  of  an  airfoil  with  bi-linear  structural  coupling,  including  freeplay,  and  later 
(1997)  implemented  a  deforming  mesh  strategy  and  investigated  the  effect  of  a  trailing-edge  flap 
on  airfoil  flutter  speed.  Beran  (1998)  developed  a  new  direct  methodology  within  BIFAE  that 
allowed  robust  computation  of  flutter  states  throughout  the  transonic  regime  and  at  angle-of-attack, 
and  adopted  a  block-relaxation  strategy  that  reduced  computational  work  and  storage 
requirements,  especially  for  large  grids.  Most  recently,  Lee.  Jiang,  and  Wong  (1998)  applied  a 
similar  bifurcation  analysis  to  a  pitch-and-plunge  airfoil  with  a  cubic  nonlinear  restoring  force.  In 
this  work,  the  aerodynamics  was  modeled  (i.e.,  not  computed  with  CFD),  allowing  a  system  of 
reduced  order  to  be  studied.  They  also  applied  asymptotic  theory  to  the  computation  of  LCO 
amplitude  for  the  supercritical,  hard-spring  case. 

Beran  and  Morton  (1997)  noted  flutter-speed  sensitivity  to  domain  size  in  the  transonic 
regime,  which  resulted  from  the  following;  growth  of  the  lateral  length  scale  in  the  transonic 
regime;  the  relatively  small  domains  used  in  their  analysis  (6  to  12  airfoil  chords),  and  the 
enforcement  of  far-field  conditions  based  on  freestream  conditions  and  extrapolation.  To  minimize 
the  sensitivity  in  the  transonic  regime,  a  corrective  action  is  required.  Grid  insensitivity  could  be 
achieved  through  domain  enlargement.  However,  owing  to  the  current  use  of  a  fully  implicit 
solution  methodology,  direct  flutter  analysis  with  computational  grids  of  increased  size  incurs 
significant  performance  penalties  (these  penalties  are  reduced  with  a  domain  decomposition 
procedure,  but  not  eliminated).  A  more  reasonable  approach  to  reducing  grid  sensitivity  is  to 
improve  the  analytical  formulation  of  the  farfield  boundary  conditions. 

One  of  the  most  important  factors  determining  the  success  or  failure  of  a  CFD  code  is  the 
boundary  conditions.  The  far  field  boundaries  are  usually  set  to  freestream  values.  Ajthough 
appropriate,  these  boundary  conditions  require  domain  enlargement  as  previously  discussed. 
Theoretically,  a  more  appropriate  boundary  condition  for  the  far  field  boundary  is  the  characteristic 
variable  method  (Janus  (1984)),  which  is  consistent  with  the  concept  of  upwinding.  Therefore, 
characteristic  variable  boundary  conditions  are  implemented  giving  a  more  efficient  smaller 
computational  domain.  In  addition,  moving  airfoils  are  known  to  generate  unsteady  disturbances, 
which  propagate  from  the  airfoil.  These  propagating  waves  must  not  be  reflected  back  rto  the 
solution  domain  by  the  farfield  boundary  conditions.  This  problem  is  avoided  by  utilizing 
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approximate  non-reflecting  boundary  conditions  for  the  unsteady  aerodynamic  analysis  (Giles 
(1990)). 

The  objective  of  this  research  is  to  implement  and  verify  improvements  obtained  by 
utilizing  higher  fidelity  farfield  boundary  conditions.  Characteristic  variable  and  approximate  non¬ 
reflecting  farfield  boundary  conditions  for  steady  and  unsteady  flows,  respectively,  will  be 
implemented  in  the  direct  BIFAE  aeroelastic  model.  A  NACA  64A006  flapped  airfoil  executing 
pitching  and  plunging  motion  will  be  utilized  for  both  zero  and  non-zero  static  pretwist 
configurations  to  demonstrate  the  accuracy  and  efficiency  improvements.  In  particular,  a  nonlinear 
transonic  flow  configuration  will  be  investigated  to  highlight  improvements  in  the  transonic  analysis 
capability  of  the  enhanced  model. 

Methodology 

In  this  section,  the  mathematical  model  for  a  pitch-and-plunge  airfoil  in  inviscid  flow  is 
briefly  summarized,  and  the  analytical  conditions  satisfied  at  flutter  points  provided.  Furfoer  details 
are  given  in  the  references  cited  above. 

The  rigid,  pitch-and-plunge  airfoil  with  flap  is  constrained  to  plunge  along  a  vertical  line 
and  is  allowed  to  pitch  about  an  elastic  axis.  See  Figure  1.  Linear  and  torsional  springs,  with 
specified  stiffness  and  damping  constants  (K„.  K*.  Kp.  D„,  D/,,  Dp),  mediate  the  motion  of  the  airfoil 
according  to  three  linear,  non-homogeneous,  second-order  ordinary  differential  equations  for  pitch 
(represented  by  the  angle  of  attack,  a),  plunge  (represented  by  the  vertical  displacement,  h),  and 
flap  deflection,  p.  These  equations  are  expressed  as  six,  first-order  differential  equations,  and  are  a 

function  of  the  reduced  velocity.  “=v-/(Arai)  where  v.  is  the  freestream  velocity,  b  is  the  semi¬ 
chord.  and  <a„  is  the  pitch  natural  frequency. 


The  governing  aerodynamic  equations  are  the  Euler  equations,  cast  in  strong-conservative 
form  for  a  general  curvilinear  coordinate  system.  These  equations  are  placed  in  discrete  form  using 
a  moving  grid-point  form  of  a  standard  upwind  total-variation  diminishing  (TVD)  scheme  (Morton 
and  Beran  (1996A)).  The  systems  of  discretized  fluid-dynamic  equations  and  structural  equations 
are  combined  into  a  single  time-dependent  system. 
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n  =  GO';«)  (1) 

where  Y  is  an  array  containing  all  the  aerodynamic  and  structural  dynamic  variables,  and  G  is  an 
array  of  all  the  model  equations.  Equilibrium  solutions  of  Equation  1  satisfy  the  nonlinear  system 
G(y;«)=0,  and  are  computed  with  Newton’s  method.  In  this  computation,  the  Jacobian  Gy  is 

calculated  numerically  to  second-order  accuracy  and  fully  stored.  The  structure  of  Gy  is  that  of  a 
bordered,  banded  matrix. 

The  aeroelastic  system  governed  by  Equation  1  is  also  simulated  through  time  integration 
following  the  approach  detailed  by  Beran  and  Morton  (1997).  Using  this  approach,  validation  of 
stable  equilibrium  solutions  obtained  with  Newton’s  method  can  be  obtained,  as  well  as  validation 
of  computed  flutter  points. 

The  equations  representative  of  a  Hopf-bifurcation,  or  flutter,  point  are  developed  by 
applying  an  assumed  form  of  the  solution  V.  At  a  Hopf  point,  a  steady-state  solution  transitions  to  an 
oscillatory  solution  with  zero  amplitude.  The  Hopf-point  solution  is  assumed  to  be  of  the  form, 
Y(f,u)=Y„(fiy£Pe^’‘ .  where  e  is  some  small  parameter  and  P  is  a  coefficient  vector.  Substitution  of  this 
assumed  form  into  Equation  1  yields  the  eigen-problem 

G_YP=pP  (2) 

The  Hopf-point  condition  is  satisfied  when  the  real  part  of  p  vanishes  for  some  complex  pair  of 
eigenvalues: 


^=ie  (3) 

where  I  is  the  imaginary  number  and  0  is  the  flutter  frequency.  With  Eqn.  2  and  Eqn.  3,  the  discrete 
equations  G  =  0  are  expanded  to  yield  a  system  of  equations,  approximately  three  times  the  size  of 
the  original  disaete  system,  for  the  critical  aeroelastic  state,  Y*,  the  critical  aeroelastic  mode,  P*, 
the  flutter  velocity,  u*,  and  the  flutter  frequency.  0*: 

G 

{G,-IQ)P 
q^P-l 

with  q  defined  as  a  user  specified  constant  array.  The  solution  of  this  expanded  system  within 
BIFAE  is  described  in  detail  by  Beran  (1998). 


Boundary  Conditions 


Theoretically  speaking  the  characteristic  variable  boundary  conditions  should  more 
accurately  model  nature  due  to  the  ability  to  pass  information  into  and  out  of  the  computational 
domain.  In  order  to  correctly  implement  these  characteristic  variable  boundary  conditions  however, 
the  outer  boundary  of  the  computational  domain  must  remain  fixed  in  space.  To  accomplish  this 
while  allowing  the  airfoil  to  move,  a  weighting  function  grid  deformation  technique  is  employed. 
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Grid  Deformation 

In  the  deforming  grid  technique,  the  outer  boundary  remains  fixed  in  space,  with  a 
deforming  grid  method  used  to  locate  the  position  of  the  airfoil.  The  grid  lines  connectng  the 
inner  and  outer  boundaries  deform  with  the  airfoil  motion.  Figure  2  shows  a  reduced  computational 
grid  with  an  exaggerated  airfoil  mccon,  which  highlights  the  deforming  grid  method.  The  amount 
of  deformation  is  a  function  of  the  distance  from  the  airfoil  surfaces.  The  grid  deformation  is 

defined  as 

Ax,,  =  iV,;(Ax,„)  (5a) 

Ay,,  =  w(Ax.,)  (5*^ 


where  Ax,j  and  Ayij  are  the  spatial  differences  that  would  exist  between  successive  time  stecs  if  the 
entire  grid  were  to  move  as  a  rigid  body.  The  weighting  function,  W,  is  defined  as 


(6) 


where  s  is  the  arc  length  of  a  grid  line  from  the  airfoil  surface,  ti  =  1,  to  some  grid  point  aong  q  = 
constant,  and  rimax  is  the  outer  boundary  grid  line  as  shown  in  Figure  2. 


From  Equations  5  and  6,  tfie  nodes  at  the  inner  boundary,  s  =  0,  give  =  1,  which  means 
the  airfoil  surface  is  deformed  completely,  as  a  rigid  body  motion.  Conversely,  the  outer  boundary 
nodes  give  W,.,  =  0,  which  signifies  that  these  nodes  remain  fixed  in  space  at  their  initial  specified 
locations.  The  interior  nodes  shear  in  space  relative  to  the  initial  grid  as  Wi.  varies  from  0  cc  1.  The 
grid  speeds  are  then  calculated  by  dividing  the  grid  deformation  by  the  time  step  used. 
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steady  State  -  Janus  Characteristic  Variable  Method 

The  steady  state  characteristic  variable  boundary  conditions  are  consistent  with  the 
concept  of  upwinding  in  which  the  signs  of  the  characteristic  velocities  determine  the  appropriate 
propagation  directions  (Janus,  1984).  A  summary  of  their  derivation  is  given  in  the  following.  The 
Euler  equations  written  in  their  non-conservative  form  are 

<h+aq^+bq^=(i  (7) 

with  the  matrices  a  and  b  determined  through  an  eigenvalue  analysis. 

The  far  field  boundary  conditions  are  obtained  by  multiplying  Equation  7  by  and 
neglecting  the  derivatives  in  the  normal  ^  direction  to  give 

ft  +  Pr)  (®) 

where  Ai,  is  a  diagonal  matrix  containing  the  eigenvalues,  ^  ,  and  p,,  and  p^*  are  the  left  and  right 

eigenvectors,  respectively. 

The  characteristic  vector  is  defined  as 

=  (9) 

Ptj  is  such  that  the  elements  of  the  characteristic  vector  become 


— +(r,.«  +  rj,v) 


{10a) 

(10b) 

(10c) 

(lOd) 


The  corresponding  eigenvalues  are 

K  =('».“ +  >7,  + 

K  =('7.«  +  '7,v)-c|V7jj 


(11a,b) 

(110) 

(lid) 


The  implementation  of  the  steady  state  far  field  characteristic  variable  boundary  conditions 
is  accomplished  by  computing  the  sign  of  X  to  determine  the  directions  of  the  characteristics.  The 
subscript  “a"  denotes  approaching  the  boundary,  “b”  refers  to  on  the  boundary,  and  “I”  leaving  the 
boundary.  Then  the  characteristics  approaching  the  boundary  are  set  equal  to  the  characteristics 
on  the  boundary  using  Equation  10. 
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)(tj.  u  +  n,v) 


+  sigt^X\  ){n,u  +  Tl,v) 

LP><=1  i 


+  sign[x'){t},u  +  l^  1 


+  sign[X\)['n,u  +  Ji,v) 


(12a) 

(12b) 

(12c) 

(12d) 


The  metrics  at  points  “a”  and  “I*  are  assumed  to  be  the  same  as  those  at  point  “b”.  Combining 
Equations  12{c,d)  gives  Equation  13a,  with  the  remaining  flow  variables  at  the  far  field  boundary 
found  by  simultaneous  solution  of  Equations  12(a-d). 


+  P,  -AC, ['?.(“.  -“,)+»?, (v,  -V,)]) 

(13a) 

p.-p. 

(13b) 

„  Pa-Pi 

(13c) 

piq 

Piq 

(13d) 

Unsteady  Flows  -  Giles  Approximate  Non-Reflectino  Characteristic  Method 

Approximate  non-reflecting  unsteady  farfield  boundary  conditions  are  developed  by  assuming 
that  linear  theory  can  be  applied.  Giles  (1990)  derived  “non-reflecting”  boundary  conditions  for  a 
general  turbomachinery  Euler  solver.  First  the  steady  flow  is  solved  using  the  previously  presented 
characteristic  variable  boundary  conditions.  The  linearized  Euler  equations  are  then  solved  at  the 
farfield  boundary  to  determine  the  perturbation  flow  variables  in  terms  of  the  characteristic 
variables.  This  allows  time  variations  of  static  pressure  at  the  farfield  and  reduces  reflections  from 
the  boundaries.  A  brief  outline  of  this  method  is  given  below. 

The  linearized,  two-dimensional  Euler  equations  are  written  in  terms  of  primitive  small- 
perturbation  variables  as 

Q.+Aa.+BQ,  =  0  (14) 


where 


g=(Sp.SuM^y 


U 

P 

0 

O' 

v 

0 

P 

o' 

0 

u 

0 

1 

0 

V 

0 

0 

P 

.  B  = 

0 

0 

0 

0 

u 

0 

V 

p 

0 

yp 

0 

0 

0 

yp 

V 

The  elements  of  the  vector  Q’  represent  perturbations  from  uniform  flow  conditions 


(15) 

(17) 
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p  =  p.+^  (18a) 

u  =  u,+Su  ^  (jgb) 

v  =  v.+&;  (18c) 

P=Po+SP  (18d) 

where  pg.  Ug,  Vg,  and  Pg  specify  the  steady  solution. 

The  matrices  A  and  B  are  evaluated  using  these  same  conditions.  The  analysis  is  greatly 
simplified  if  the  unsteady  perturbations  and  the  steady  variables  in  A  and  B  are 
nondimensionalized  using  the  steady  density  and  speed  of  sound.  With  this  choice  of 
nondimensionalization,  the  final  forms  of  the  matrices  A  and  B  are 


1 

0 

0  ■ 

0 

1 

0  ■ 

0 

0 

1 

,  5  = 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0 

0 

1 

where  Mx  and  My  are  the  Mach  numbers  in  the  x  and  y  directions. 

A  set  of  eigenvectors  representing  an  entropy  wave,  a  vorticity  wave  and  an 

upstream/downstream  running  pressure  waves  are  determined  by  Fourier  analysis  of  these 

equations.  Using  these  eigenvectors  and  assuming  locally  one-dimensional  flow  at  the  boundary, 

the  characteristic  variables  for  unsteady  flows  are  written  in  terms  of  the  perturbation  variables  as 

■c,l  r-i  0  0  oir^l  (20) 

c,  _  0  0  1  1  & 
c,  "  0  1  0  1  &- 

cj  [o  -1  0  oJ[5P 

For  a  subsonic  freestream  flow,  the  amplitudes  of  the  incoming  unsteady  characteristics  (Cl,  C2, 
C3)  are  set  to  zero  and  the  outgoing  characteristic  (C4)  is  computed  using  Equation  20.  For 
subsonic  downstream  flow,  C4=0  and  the  remaining  characteristics  are  computed.  Once  the 
characteristics  are  known,  the  perturbation  variables  are  found  using  an  inverse  transform 

M  h  ”  j  jjp'i 

^  _  0  0  - 

0  1  0  0^ 

0  0  -  -  L^*. 

L  2  2  J 

I 

The  primitive  flow  variables  are  found  using  Equation  18.  With  the  non-conservative  flow 
variables  found,  the  conservative  dependent  variables  are  easily  computed  for  the  farfield 
boundaries. 


Results 

The  direct  computational  procedure  BIFAE  has  been  successfully  validated  against  other 
inviscid  methods,  including  Euler  and  transonic  small-disturbance  methods  (Morton  and  Beran 
(1997)  and  Beran  and  Morton  (1997)).  Validation  of  BIFAE  was  also  the  subject  of  work  by  Buxton 
and  Beran  (1997).  In  this  report,  both  equilibrium  and  flutter  point  solutions  are  obtained  for  these 
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same  configurations  to  validate  the  new  farfield  boundary  condition  modifications.  The  sensitivity 
of  these  new  boundary  conditions  to  domain  size  is  examined  through  the  use  of  flutter  boundaries 
and  Cp  distributions.  Results  were  obtained  using  a  standardized  set  of  structural  and  aerodynamic 
parameters  for  a  baseline  configuration  of  the  BIFAE  algorithm. 

NACA  64A006  Flapped  Airfoil  Pitching  and  Plunging 

The  enhanced  Hopf-bifurcation  analysis  is  applied  to  a  NACA  64A006  flapped  airfoil  with 
pitching  and  plunging  motion  for  various  computational  grids  and  aerodynamic  parameters.  The 
structural  parameters  were  not  varied  in  this  research,  because  the  improvements  to  the  analysis 
were  related  to  the  aerodynamic  model.  Therefore,  only  variations  in  the  aerodynamic  parameters 
were  required  to  demonstrate  the  successful  implementation  of  the  improved  flow  physics  model. 

The  structural  parameters  utilized  for  all  the  computational  results  are  presented  in  Table  1. 


Symbol 

Description 

Value 

center  of  gravity  (from  LE) 

0.5 

airfoil  damping  ratios 

0.0 

flap  damping  ratio 

0.01 

(Oh/0)„ 

natural  frequency  ratio 

0.3 

(Op/cOa 

flap  natural  frequency  ratio 

1.5 

Xa 

elastic-axis  location 

-0.2 

11 

airfoil-fluid  mass  ratio 

50 

Table  1;  NACA  64A006  Structural  Parameter  Values 
To  demonstrate  the  increased  solution  accuracy  obtained  by  modifying  the  farfield 
boundary  conditions,  three  different  computational  domains  were  utilized.  Table  2  gives  specific 
details  about  each  computational  grid  used  to  model  the  NACA  64A006  airfoil. 


Grid 

A|o 

Rrrax 

1 

J 

G-1 

0.0010 

0.0005 

0.0006 

5 

84 

31 

G-3 

0.0010 

0.0005 

0.0006 

50 

84 

43 

G-4 

0.0010 

0.0005 

0.0006 

150 

84 

49 

Table  2;  NACA  64A006  Computational  Grids 

As  shown  in  Table  2,  the  three  computational  grids  used  are  the  same  grid  except  for  the  location 
of  the  farfield  boundary.  The  G-1  grid  has  a  domain  size  of  5  airfoil  chords  with  the  G-3  and  G-4 
grids  having  50  and  150  chords  domain  sizes,  respectively.  Utilizing  the  same  grids  except  for  in 
the  J  direction,  ensures  that  any  difference  in  the  predictions  is  caused  by  modifications  to  the 
farfield  boundary  condition  or  boundary  location. 

Since  the  direct  hopf-bifurcation  model  is  optimized  for  calculating  flutter  boundaries,  then 
flutter  boundary  studies  will  be  utilized  to  demonstrate  the  enhancements  to  the  model  accuracy 
obtained  with  the  new  farfield  boundary  condtions.  These  results  are  typically  obtained  by 
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establishing  all  the  aerodynamic  parameters,  then  incrementally  modifying  only  one  parameter. 
Traditionally,  the  parameter  which  is  incremented  is  the  freestream  Mach  number.  The  initial 
investigation  of  the  enhanced  analysis  was  accomplished  by  varying  the  freestream  Mach  number 
from  0.7  to  0.9  with  a  static  pre-twist  angle  of  attack  of  0°.  The  continuation  feature  designed  into 
the  analysis  was  utilized  for  these  investigations.  After  the  flutter  point  was  calculated  for  the  0.7 
Mach  case,  the  Mach  number  was  incremented  by  0.01  with  the  previous  flutter  point  solution  used 
to  begin  the  solution.  The  analysis  was  continued  until  the  solution  did  not  converge  on  a  flutter 
point  for  a  particular  Mach  number. 

Initially,  obtaining  a  flutter  boundary  that  is  independent  of  the  computational  domain  size 
was  accomplished.  Figures  3  &  4  shows  the  flutter  boundary  analysis  of  the  flutter  speed  and 
reduced  frequency  variation  with  Mach  number  for  the  84x43  and  84x49  grids,  which  have  domain 
sizes  of  50  and  150  chords,  respectively.  The  analysis  did  not  converge  above  a  Mach  number  of 
0.88.  The  flutter  boundaries  for  both  domains  are  exactly  the  same.  Therefore,  the  baseline  grid 
independent  solution  for  the  specified  structural  and  aerodynamic  parameters  is  known.  The  flutter 
bucket  for  this  configuration  occurs  between  0.83  and  0.85  Mach  number.  The  flutter  bucket 
location  is  of  primary  interest  when  flutter  boundaries  are  analyzed. 


0.7  0.72  0.74  0.76  0.78  0.8  0.82  0.84  0.86  0.88 

Freestream  Mach  Number 


Figure  3:  Reduced  Velocity  Flutter  Boundary  for  a  NACA  64A006  Airfoil  with  Flap 
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Included  in  Figures  3  &  4  is  the  unmodified  BIFAE  analysis  using  an  84x31  computational 
grid,  which  has  a  domain  size  of  5  chords.  As  shown  in  Figure  3,  the  5  chord  domain  results  under 
predict  the  flutter  speed  slightly  at  M=0.7  (0.8%),  but  the  prediction  is  off  by  3.8%  at  M=0.84,  which 
is  in  the  flutter  bucket  With  regards  to  the  reduced  frequency,  shown  in  Figure  4,  the  unmodified 
model  over  predicts  the  correct  frequency  by  1.0  %  for  all  Mach  numbers.  The  difference  in  the 
model  predictions  just  discussed  are  entirely  attributed  to  the  reduction  in  domain  size  from  50 
chords  to  5  chords.  This  means  that  the  solution  is  dependent  on  the  domain  size  of  5  chords  for 
the  farfield  boundary. 


Improvement  in  Flutter-Frequency  Prediction  with  Farfield  Locking 


Figure  4:  Reduced  Frequency  Flutter  Boundary  for  a  NACA  64A006  Airfoil  with  Flap 
In  addition.  Figures  3  &  4  show  the  BIFAE  analysis  obtained  with  the  grid  deformation 
modification  implemented.  This  modification  locks  the  movement  of  the  grid  at  the  farfield 
boundary  using  a  weighting  function.  As  shown  in  Figure  3,  the  flutter  speed  prediction  is  improved 
at  all  Mach  numbers.  At  M=0.7  the  solution  is  off  by  0.2%  and  at  M=0.84  the  reduced  velocity 
predicted  is  off  by  2.4%.  Therefore,  the  percentage  improvement  increased  with  increasing  Mach 
number.  For  the  reduced  frequency.  Figure  4,  the  improvement  was  fairly  uniform  with  Mach 
number,  the  predicted  value  was  off  by  0.8%  from  the  grid  independent  analysis.  These  results 
show  the  improvement  in  the  BIFAE  model  obtained  by  incorporating  the  grid  deformation  model. 


34-13 


The  improvements  in  both  the  flutter  speed  and  reduced  frequency  obtained  by  modifying  the  grid 
deformation  model  are  more  significant  than  expected. 


Improvement  in  Flutter-Speed  Prediction  with  New  Boundary  Conditions 


Freestream  Mach  Number 

Figure  5;  Reduced  Velocity  Flutter  Boundary  Effect  of  New  Unsteady  Boundary  Conditions 
To  investigate  the  influence  of  improvements  to  the  farfield  boundary  conditions,  various 
combinations  of  the  original  freestream,  the  steady  characteristic,  and  the  unsteady  approximate 
conditions  are  shown  in  Figures  5  and  6  for  flutter  speed  and  reduced  frequency,  respectively.  All 
the  results  shown  utilize  the  locked  grid  deformation  model  with  the  different  farfield  boundary 
condition.  The  grid  independent  result  from  a  50  chord  domain  analysis  (J=43)  is  given  as  the 
reference  solution  for  this  configuration.  Also,  the  locked  original  boundary  condition  Solution 
shown  in  Figures  3  and  4  is  given  (i.e.  ‘Baseline  Grid(J=31):Freestream-Freestream’).  At  an  inlet 
Mach  number  of  0.7  the  flutter  speed  is  only  slightly  off  with  the  original  boundary  conditions,  using 
both  the  steady  characteristics  of  Janus  and  the  unsteady  approximate  of  Giles  boundary 
conditions  give  a  more  accurate  prediction,  but  the  improvement  is  modest.  As  the  Mach  number 
is  increased  the  improvement  in  the  predictions  of  both  flutter  speed  and  reduced  frequency  from 
the  original  freestream  to  Janus-Giles  conditions  gets  more  significant.  For  the  flutter  speed 
prediction  at  M=.85,  see  Figure  5,  utilizing  the  Giles  unsteady  conditions  alone  results  in  a  0.6% 
improvement  while  using  both  Janus’s  steady  and  Giles  unsteady  results  in  a  1.4%  improvement  in 
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flutter  speed.  While  for  the  reduced  frequency  analysis,  see  Figure  6,  the  improvements  were  0.2% 
and  0.8%,  respectively.  It  is  important  to  note,  at  a  freestream  Mach  number  of  0.84  the 
improvement  from  using  Janus’s  steady  boundary  conditions  is  nearly  zero  in  the  flutter  speed 
prediction  (Figure  5),  but  is  quite  significant  in  terms  of  the  reduced  frequency  prediction  (Figure 
6).  These  results  warrant  a  more  detailed  investigation  at  this  Mach  number. 


Improvement  in  Rutter-Frequency  Prediction  with  New  Boundary  Conditions 


Figure  6:  Reduced  Frequency  Flutter  Boundary  Effect  of  New  Unsteady  Boundary  Conditions 

Since  the  accurate  prediction  of  the  flutter  bucket  is  of  significant  interest  when  doing 
flutter  boundary  studies,  the  significant  change  in  solution  accuracy  from  M=0.84  to  M=0.85  v/as 
investigated  in  more  detail.  The  only  change  between  the  results  at  M=0.85  and  M=0.84  was  in  the 
steady  farfield  boundary  conditions.  At  M=0.85  a  significant  effect  was  seen  when  Janus  conditions 
were  utilized  but  at  M=0.84  no  effect  was  shown.  Therefore,  the  steady  surface  pressure  distribution 
was  investigated  for  differences  as  shown  in  Figure  7.  It  is  quite  apparent  from  Figure  7  that  there  is 
no  effect  on  the  steady  pressure  distribution  no  matter  which  farfield  boundary  conditions  are  used. 
Another  interesting  feature  to  note  shown  in  Figure  7  is  the  lack  of  a  shock  wave.  It  is  well  known 
that  the  freestream  boundary  conditions  have  problems  accurately  modeling  nonlinear  flow 
phenomena.  All  the  results  given  have  been  for  zero  static  pre-twist.  By  increasing  the  airfoil  angle 
of  attack,  a  shock  wave  will  form  on  the  suction  surface  therefore,  giving  a  better  test  of  the 
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improved  analysis  model.  Since  the  predictions  at  a  freestream  Mach  number  of  0.84  did  not  show 
a  significant  improvement  in  the  flutter  speed,  the  angle  of  attack  is  varied  for  this  speed  by 
increasing  the  static  pre-twist  angle  in  0.1°  increments  until  1.0°  is  reached. 


Improvement  in  Equilibrium  Solution  with  Janus  Boundary  Condition 


Figure  8  shows  the  steady  surface  pressure  distribution  for  the  1°  static  pre-twist  analysis  for 
150,  50,  and  5  chord  domain  sizes  for  both  Janus  and  the  original  freestream  farfield  boundary 
conditions.  At  this  angle  of  attack,  the  solution  is  grid  independent  at  the  50  chord  domain  size  as 
shown  by  comparing  the  150  and  50  (i.e.  J=49  and  J=43,  respectively)  chord  results.  In  addition. 
Figure  8  shows  a  normal  shock  on  the  suction  surface  at  75%  of  the  airfoil  chord.  Comparing  the 
predictions  with  Janus  steady  boundary  conditions  to  the  original  freestream  conditions  shows  a 
significant  improvement  in  the  Cp  distribution  at  the  shock  location.  The  original  freestream 
conditions  predicted  the  shock  starting  at  the  60%  chord  location,  while  the  Janus  conditions 
predicted  the  shock  starting  at  70%  chord.  In  addition,  the  Janus  calculations  predicted  the  same 
ending  location  for  the  shock  wave  as  the  50  chord  solution,  while  the  freestream  conditions  where 
8%  chord  upsteam. 
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Improvement  in  Equilibrium  Solution  with  Janus  Boundary  Condition 


Figure  9  shows  the  flutter  speed  variation  with  the  different  static  pre-twist  angles  for  a 
freestream  Mach  number  of  0.84.  Grid  independence  was  achieved  for  the  150  and  50  chord 
domains  solutions  as  shown  in  Figure  9.  For  the  5  chords  domain  (J=31)  results,  all  the  different 
model  configurations  are  shown  for  the  direct  solver.  The  solution  improvement  from  utilizing  the 
locked  grid  deformation  model  is  fairly  consistent  with  changes  in  static  pre-twist.  Therefore,  these 
improvements  are  not  related  to  the  nonlinear  effects  introduced  by  the  strong  shock  wave. 
Comparing  the  approximate  non-reflecting  unsteady  boundary  conditions  with  the  original 
freestream  conditions  resulted  in  another  constant  improvement  in  the  flutter  speed  prediction  for 
all  static  pre-twist  values.  Again,  the  conclusion  is  drawn  that  the  approximate  non-reflecting 
unsteady  boundary  conditions  are  not  being  significantly  affected  by  the  nonlinear  nature  of  the 
flow  physics. 

When  the  steady  characteristic  boundary  conditions  are  introduced  with  Giles  unsteady 
conditions  the  improvement  in  flutter  speed  increases  with  increasing  static  pre-twist.  At  0-  static 
pre-twist,  the  improvement  is  nearly  zero  while  at  1°  the  flutter  speed  prediction  is  2.954  versus 
2.812  without  the  new  steady  boundary  conditions.  The  grid  independent  flutter  speed  is  3.005  at 
1°.  Therefore,  the  solution  error  is  reduced  nenriy  $  jjy  introducing  the  steady  characteristic 
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boundary  conditions,  while  each  of  the  other  modifications  result  in  a  10%  reduction,  respectively. 
This  is  a  strong  indication  that  the  steady  characteristic  boundary  conditions  do  a  significantly 
better  job  of  modeling  the  nonlinear  shock  effects.  Accurate  shock  predictions  are  required  for 
transonic  flight  and  beyond. 


In  addition,  it  is  interesting  to  note  the  slight  scatter  in  the  results  at  0.4°  and  0.7°.  For  these 
conditions  the  analysis  required  reducing  the  CFL  number,  the  under-relaxation ,  parameter  and 
increasing  the  number  of  iterations  to  get  a  converged  flutter  point  solution.  Typically,  the  flutter 
speed  variation  with  static  pre-twist  is  a  smoother  curve.  Several  possible  reasons  for  the '  results 
shown  in  Figure  9  are  inaccuracies  in  the  trailing  edge  model  and  resolution  of  the  shock  location 
on  the  suction  surface.  As  shown  in  Figure  8,  the  shock  wave  was  smeared  over  three  grid  points. 
The  shock  position  is  extremely  important  in  the  calculation  of  the  flutter  speed,  therefore  any  error 
in  its  location  will  have  significant  effects  on  the  aeroelastic  analysis. 

Summary  and  Conclusion 

The  direct  computational  procedure  BIFAE  has  been  successfully  enhanced  by  improved 
flow  physics  models  of  the  farfield  boundary  conditions.  In  particular,  a  deforming  grid  model  was 
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implemented  which  locks  the  farfield  grid  while  allowing  the  grid  near  the  airfoil  surface  to  deform 
with  the  airfoil  movement.  In  addition,  steady  characteristic  and  approximate  non-reflecting 
unsteady  farfield  boundary  conditions  have  been  incorporated  into  the  BIFAE  model.  A  NACA 
64A006  flapped  airfoil,  executing  pitching  and  plunging  motion,  was  utilized  for  verifying  the 
successful  implementation  of  these  higher  fidelity  models.  Both  zero  and  non-zero  angle  of  attack 
results  were  shown  to  highlight  the  improved  efficiency  of  the  enhanced  model.  For  a  1°  static  pre¬ 
twist  analysis  at  a  freestream  Mach  number  of  0.84,  the  new  model  resulted  in  over  a  75%  decrease 
in  the  flutter  speed  error.  Therefore,  the  capability  to  more  accurately  model  transonic  flow 
conditions  with  strong  shock  interaction  has  been  shown. 

Potential  enhancements  to  the  direct  BIFAE  model  as  a  consequence  of  the  research 
described  in  this  report  are  as  follows:  1)  implementation  of  the  approximate  non-reflecting 
unsteady  2-D  farfield  boundary  conditions  of  Giles;  2)  implement  the  Giles  steady  farfield  boundary 
conditions;  and  3)  implement  the  geometric  conservation  law  to  properly  account  for  the  grid 
deformation.  The  results  presented  in  this  report  could  be  improved  by  investigating  the  rounded 
trailing  edge  and  chordwise  grid  resolution.  In  addition,  it  would  be  beneficial  to  identify  the 
precise  computational  domain  size  required  with  the  new  farfield  boundary  conditions  for  a  grid 
independent  solution.  These  improvements  were  not  completed  in  the  current  investigation 
because  of  a  lack  to  time. 
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Abstract 


Numerous  finite-difference,  time-domain  (FDTD)  algorithms  for  dispersive  media  are  presented  and  stud¬ 
ied.  To  limit  the  scope  of  the  discussion,  only  direct  integration  and  recursive  convolution  techniques  are 
considered  in  conjunction  with  the  following  media  types:  the  isotropic  cold  plasma  and  the  multi— pole 
Debye  dielectric.  The  study  primarily  focuses  on  the  numerical  permittivity  calculation  that  results  from 
the  discretization  procedure.  Where  appropriate,  this  permittivity  is  used  to  quantify  the  temporal  error 
mechanisms  of  the  scheme.  Discussions  are  provided  that  assess  the  overall  strengths  and  weaknesses  of  each 
scheme. 
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A  DETAILED  STUDY  OF  THE  NUMERICAL  PROPERTIES 
OF  FDTD  ALGORITHMS  FOR  DISPERSIVE  MEDIA 


Jeffrey  L.  Young 


1  Introduction 

The  finite-difTerence,  time-domain  method  (FDTD)  has  emerged  as  a  leading  numerical  tool  for  the  solu¬ 
tion  of  the  temporally  dependent  Maxwell’s  equations  [1,  2].  Originally  the  scheme  was  created  to  dis¬ 
cretize  Maxwell’s  equations,  with  the  assumption  that  the  medium  that  hosted  the  electromagnetic  wave 
was  isotropic,  non-dispersive,  linear  and  time-invariant.  In  many  applications  that  arise  in  the  study  of 
electromagnetic  waves,  such  assumptions  are  indeed  valid. 

As  the  method  grew  in  popularity  and  proved  its  worth  in  the  prediction  of  scattering,  diffraction  and 
propagation  events,  many  researchers  postulated  and  then  devised  ways  that  the  algorithm  could  be  applied 
to  problems  for  which  the  medium  was  no  longer  simple  in  its  composition.  Such  media  types  include  the 
ionosphere,  biological  tissues,  crystalline  structures,  ferrites,  optical  fibers,  and  radar  absorbing  materials, 
to  name  a  few.  Whatever  the  case  may  be,  however,  these  media  are  subsets  of  what  is  generally  termed  as 
complex  media  -  media  that  is  described  by  one  or  many  of  the  following  descriptors:  dispersive,  anisotropic, 
nonlinear  and  time-variant  [3]. 

This  report  examines  the  most  recent  and  popular  advances  in  FDTD  algorithm  development  as  applied  to 
electromagnetic  wave  propagation  in  complex  media.  Several  methodologies  are  presented  and  comparisons 
are  made  in  order  to  demonstrate  the  strengths  and  weaknesses  of  the  various  approaches.  Particularly, 
direct  integration  and  recursive  convolution  schemes  associated  with  wave  propagation  in  a  cold  plasma  and 
a  Debye  dielectric  are  considered. 

Error  analyses  are  also  provided.  Since  the  discretized  equations  are  approximations  of  Maxwell’s  equa¬ 
tions,  the  data  obtained  from  the  FDTD  algorithm  will  be  corrupted  by  errors  of  the  dissipative  or  dispersive 
kind.  These  errors  linearly  accumulate  after  each  time  step  and  hence,  the  data  will  be  attenuated  and  out 
of  phase  with  the  data  obtained  from  the  exact  solution.  To  quantify  these  errors,  numerical  dispersion 
relationships  are  derived.  These  same  dispersion  relationships  also  provide  information  on  the  stability  of 
the  scheme;  this  information  is  couched  in  terms  of  a  Courant-Friedrichs-Lewy  (CFL)  number  [4].  Thus, 
for  those  schemes  for  which  the  stability  properties  are  known,  that  information  is  also  provided. 

2  McLXwell’s  Equations  and  Complex  Media 

According  to  Maxwell,  the  electromagnetic  field  is  comprised  of  an  electric  field  E(x,  t)  and  a  magnetic  field 
B(x,<)  that  satisfy,  at  a  point,  two  curl  equations  and  two  divergence  equations  [5]: 


(1) 

ary 

VxH  =  -  +  J, 

(2) 
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11 

0 

(3) 

35.3 


and 


V  ♦  B  =  0.  (4) 

Here  D  is  the  electric  flux  density  and  H  is  the  magnetic  intensity.  The  terms  J  and  M  are  often  regarded 
as  source  terms,  and  in  that  context  they  are  the  electric  and  magnetic  current  densities,  respectively.  For 
purposes  of  numerical  computation,  the  two  curl  equations  are  discretized  and  advanced  in  time;  the  two 
divergence  equations  are  additional  constraints  imposed  upon  the  fields.  To  bring  closure  to  this  system  of 
equations,  a  set  of  constitutive  relationships  are  imposed  that  quantify  the  effects  of  a  material  medium  on 
an  applied  electromagnetic  field.  In  general,  we  write 

D  =  D(E,H)  B  =  B(E,H).  (5) 

In  addition,  the  currents  may  also  be  coupled  to  the  fields,  in  which  case 

J  =  J(E,H)  M  =  M(E,H),  (6) 

It  is  in  this  latter  context  that  we  will  invoke  the  notion  of  current.  That  is,  rather  than  the  current  being 
typified  as  impressed,  it  will  be  considered  induced  -  a  current  generated  by  a  field.  For  example,  in  the 
study  of  wave  propagation  in  a  cold  plasma,  J  is  replaced  with  Jp,  the  polarization  current  generated  in  the 
plasma  by  an  applied  field.  By  definition,  the  polarization  current  is  viewed  as  the  time  rate  of  change  of 
the  polarization  vector  P  (i.e.  Jp  =  9P/9t).  The  inclusion  of  P  into  the  field  description  proves  useful  when 
studying  propagation  in  Debye-type  materials. 

If  the  material  is  simple  and  source-free,  the  previous  relationships  reduce  to  D  =  eE,  B  =  /iH,  J  = 
crE  and  M  =  0,  where  the  constants  e,  pi  and  a  are  the  electric  permittivity,  magnetic  permeability  and 
electric  conductivity,  respectively.  Written  as  such,  J  is  now  interpreted  as  the  induced  conduction  current. 
For  complex  materials,  the  constitutive  relationships  do  not  reduce  to  such  simple  forms.  Instead,  the 
relationships  may  be  cast  in  terms  of  convolution  integrals,  matrices  or  derivatives.  For  these  situations,  the 
material  may  be  characterized  as  dispersive,  anisotropic,  or  non-linear,  to  name  a  few  possibilities. 


3  FDTD  Method 


Consider  a  medium  that  is  simple  and  source-free,  but  lossy.  The  FDTD  method  is  based  upon  the  ap¬ 
plication  of  central  differences  for  both  temporal  and  spatial  discretization  [6]  which,  for  the  sake  of  future 
developments,  will  be  considered  separately.  With  respect  to  the  former,  the  temporal  discretization  of 
MaxwelFs  curl  equations  yields 

/H”+i/2  _ 

It  - - -  =  -V  X  E"  (7) 


where  we  have  employed  central  averages  for  the  loss  term  in  Ampere’s  law.  Here  6t  is  the  time  increment 
and  E”  denotes  the  value  of  E  at  time  nSt.  Upon  the  rearrangement  of  terms, 
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Figure  1:  The  Yee  cell 


fE"+i  =  e  ( E"  +  f  - — V  X  (10) 

\l  +  (T6t/2j  \l  +  (T6t/2j  ^  ' 

Whence,  the  update  equation  for  the  field  constituents  is  readily  observed.  Given  that  E  and  H  are  syn¬ 
chronized,  but  syncopated  by  a  half-time  step,  this  type  of  integration  scheme  is  commonly  referred  to  as 
leap-frog  integration.  The  previous  equations  constitute  a  second-order  temporal  method,  as  deduced  from 
the  truncation  term  of  the  central  difference  and  average  approximations. 

To  accomplish  the  spatial  discretization,  the  Yee  grid  is  invoked,  as  shown  in  Figure  1.  The  components  of 
the  Yee  grid  are  arranged  in  accordance  with  the  natural  rotational  structure  of  the  curl  operator  acting  upon 
H,  which  is  also  spatially  proportional  to  E  (or  vice-versa).  For  example,  upon  the  spatial  discretization  of 
the  z-component  of  Ampere’s  law,  we  obtain. 


•  V  X  H 


i+l/2J  +  l/2,k 


Hy 

-Hy 

Hr 

-Hr 

.+1  ^ 

i 

J  +  1 

j 

i+i/2,fc  L 


i+l/2,k 


(11) 


The  indices  (i,i,  ib)  denote  the  spatial  coordinates  (iSxyj^yykSz)  with  6^^  Sy  and  Sz  being  the  length  of  the 
cell  in  the  x^y^z  directions,  respectively.  The  remaining  five  components  of  electromagnetic  field  are  derived 
in  a  similar  manner. 

To  quantify  the  FDTD  method  in  terms  of  its  dissipation  and  dispersion  errors,  consider  a  plane  wave 
propagating  in  the  k  direction  and  in  an  homogeneous  open  domain.  That  is,  let  E  =  Eo  e  and 

H  =  E©  and  Ho  are  constant  vectors.  If  E  and  H  are  to  be  solutions  of  the  discretized  form 

of  Maxwell’s  equations,  then 

-  jK  X  Eo  = (12) 
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and 


(13) 


-  jK  X  Ho  =  jfifEo  +  AtrEo ; 
moreover,  K  •  Eo  =  0  and  K  •  Ho  =  0.  Here 


Obviously,  K  k,  Q  u  and  A  ^  1  as  ^ j, ,  (Jj, ,  6^ ,  — *•  0,  as  required  for  a  consistent  scheme.  Further 

manipulation  of  Eqns.  (12)  and  (13)  leads  to  the  equation 

K  •  K  =  —jQfM{jQe  +  Act)  =  (17) 

which  is  the  numerical  dispersion  equation  and  is  the  counterpart  of  the  analytical  dispersion  equation 
=  —  Jw/i(jwc  +  CT);  here  e  =  [e  +  ACT/(jQ)].  In  many  of  the  subsequent  sections  this  same  equation  will  be 
referenced,  but  different  permittivity  relationships  will  be  supplied.  Whatever  the  case  may  be,  Eqn.  (17) 
provides  the  necessary  information  for  understanding  the  scheme’s  dispersion,  dissipation,  anisotropicity  and 
stability  properties  [7,  8]. 

Stability  (i.e.,  non-exponential  growth  in  time)  is  insured  if  Im{a;(5t}  >  0.  From  Eqn.  (17)  and  for  ct  =  0, 
this  constraint  is  satisfied  for  all  directions  of  propagation  provided  that  [2] 


Here  Vp  —  1/ ^y/I€,  which  is  the  phase  velocity  (if  e  =  Cq  and  then  Vp  =  c,  the  speed  of  light).  Note: 

If  Eqn.  (18)  is  satisfied,  Im{w5<}  =  0,  which  implies  a  dissipationless  numerical  scheme.  Finally,  given  that 
the  FDTD  method  is  both  consistent  and  stable.  Lax’s  theorem  states  that  the  data  will  converge  to  the 
exact  solution  as  the  cell  and  time— step  become  infinitesimally  small  [9]. 

In  the  ensuing  sections,  the  FDTD  method  will  be  modified  or  extended  to  model  wave  propagation  in 
complex  media.  However,  in  all  cases  considered,  the  Yee  grid  of  Figure  1  and  the  discretization  of  the  curl 
operators  will  be  retained. 


4  Cold  Plasma 


To  demonstrate  the  wide  variety  of  techniques  available  to  model  numerically  wave  propagation  in  linearly 
dispersive  media,  the  case  of  wave  propagation  in  an  isotropic,  lossy,  cold  plasma  is  considered  next.  For 
this  scenario.  Ampere’s  law  and  the  momentum  equation  are  [10] 


5E 

(19) 

2t, 

—  =  -I/Jp  +  foW^E, 

(20) 
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where  3p  is  the  polarization  current,  u  is  the  collision  frequency  and  Up  is  the  plasma  frequency;  specifically 
Jp  =  — no?u  and  LJp  =  y/noq^/{m€o).  For  time-harmonic  signals  of  dependence  it  is  a  simple  exercise 
to  show  that  the  momentum  equation  can  be  embedded  into  Ampere’s  law,  from  which  a  scalar  frequency 
domain  permittivity  i  can  be  defined.  Suppressing  that  analysis,  one  finds  that 

where  x  is  the  scalar  frequency  domain  susceptibility.  Since  D  =  eE,  the  corresponding  time-domain 
constitutive  relationship,  cast  in  terms  of  a  convolution  integral,  is  deducible  from  Fourier  theory  [11]: 

D(x,<)  =  eoE(x,i)  +  Co  /  x(T)E(x,f  -  r)  dr,  (22) 

J  —  OO 

where 

X(r)  =  ^  [1  -  Uir).  (23) 

Here  U  is  the  unit  step  function. 

Each  equation  in  the  previous  development  gives  a  different  insight  into  the  phenomenology  of  wave  prop¬ 
agation  in  a  cold  plasma.  And,  for  each  of  those  insights,  there  exists  a  corresponding  FDTD  discretization 
methodology.  In  this  section,  we  will  explore  several  such  methodologies  and  will  classify  those  methodologies 
into  two  categories:  Direct  integration  methods  and  recursive  convolution  methods.  The  direct  integration 
methods  are  either  based  on  the  fundamental  state  equations  of  Eqns.  (19)  and  (20)  or  on  the  constitutive 
relation  D  =  eE.  The  recursive  methods  incorporate  the  convolution  integral  of  Eqn.  (22)  into  the  FDTD 
paradigm^.  Although  each  discretization  procedure  is  consistent  with  the  governing  equations,  their  accu¬ 
racy  order  and  memory  requirements  are  different.  Such  similarities  and  differences  are  documented  in  the 
last  subsection. 

In  the  following  subsections,  Eqn.  (9)  is  the  temporally  discretized  form  of  Faraday’s  law.  Additionally, 
the  components  of  Jp,  D  and  E  are  assumed  to  occupy  the  same  locations  in  the  Yee  grid. 


4.1  Direct  Integration  Method  One:  CP-DIMl 


Consider  first  a  state-space  integration  technique.  For  this  situation,  Eqns.  (19)  and  (20)  are  discretized 
using  central  differences  for  the  derivatives  and  central  averages  for  the  collision  term.  To  maintain  syn¬ 
chronization  with  the  FDTD  leap-frog  integrator,  the  momentum  equation  and  Faraday’s  law  are  advanced 
simultaneously  and  collectively  leaped  with  Ampere’s  law.  For  source-free  media,  this  procedure  results  in 
the  following  system  of  equations  [15]: 


,n+l/2  .jn-l/2 

Jp  Jp 


St 


+  eoW^E" 


(24) 


and 

)  =  V  X  (25) 

Given  the  simplicity  of  the  algebraic  manipulations  that  solve  for  in  Eqn.  (24),  the  final  expression 

is  not  given.  This  will  be  the  case  for  many  of  the  other  direct  integration  methods  as  well. 

^  Since  the  Z-- transform  method  is  a  subset  of  the  recursive  method,  it  will  not  be  documented  herein.  For  more  information 
on  using  Z-transforms  in  conjunction  with  the  FDTD  method  consult  [12]-[14]. 


£n  +  l  _  £n 
St 
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4.2  Direct  Integration  Method  Two:  CP-~DIM2 


Instead  of  averaging  the  loss  term,  as  in  CP-DIM  1,  averaging  the  current  term  in  Ampere’s  law  is  equally 
possible.  For  this  case,  the  momentum  equation  and  Ampere’s  law  are  synchronized  and  collectively  leaped 
with  Faraday’s  law  [16].  That  is, 

^  =  -i/J^  +  CoWpE"  (26) 


and 


26i 


6t 


(27) 


4.3  Direct  Integration  Method  Three:  CP-DIM3 

Next  consider  the  case  in  which  the  polarization  current  in  Ampere’s  law,  the  electric  field  term  in  the 
momentum  equation  and  the  dissipation  term  in  the  momentum  equation  are  all  time  averaged.  The  syn¬ 
chronization  of  the  momentum  equation  and  Ampere’s  law  is  maintained  [17].  The  resulting  scheme  is 


and 


'E"+i  +  E"'\ 

2  ) 

(28) 

(29) 

4.4  Direct  Integration  Method  Four:  CP-DIM4 

Observing  that  the  momentum  equation  has  solutions  that  decay  exponentially  like  Cummer  suggests 
that  exponential  fitting  is  necessary,  particularly  when  the  dissipation  term  is  large  [17].  To  that  end,  the 
exponentially  fitted  equation  is 


jn+i  ^  ^  ^  ^  J,„+1  +  ^  (1  _  e-"*'  -  E". 

Ampere’s  law  still  requires  the  averaging  of  the  polarization  current: 


(30) 


(31) 


4.5  Direct  Integration  Method  Five:  CP— DIM5 

The  next  direct  integration  technique  considers  the  frequency  domain  constitutive  relationship  D  =  eE  and 
Eqn.  (21)  as  the  basis  of  temporal  discretization  [18].  Combining  these  two  equations,  we  obtain 


(w^  —  ywi/)D  =  eo(w^  —  jwj/  —  Wp)E. 


(32) 


Since  multiplications  of  ju  in  the  frequency  domain  are  equivalent  to  time  derivatives  in  the  time-domain, 
Eqn.  (32)  is  equivalent  to 


dB  d^E  dE  , 


(33) 
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Discretization  of  Eqn.  (33)  with  second-order  central  differences,  along  with  Eqn.  (2),  yields  the  following 
system  of  semi-discrete  numerical  equations: 

D"+i  =  D”  +  6tV  X  (34) 


eo(2  +  i/<5<)E"+i  =  (2  +  j/(50D"+i + 

-  2eo(wp^?-2)E"-eo(2-Jv6<)E"-^  (35) 

4.6  Recursive  Convolution  Method  One:  CP-RCMl 

The  recursive  convolution  method  is  based  upon  the  discretization  of  the  convolution  integral  of  Eqn.  (22) 
[19].  At  time  nSt, 

rndi 

D(x,  n6t)  =  eoE(x,  nSt)  +  e©  /  E(x,  nSt  -  'r)x(r)  dr.  (36) 

Jo 

Under  the  assumption  that  E  is  piece-wise  constant, 

n-l  r(^+l)6i 


D"  =  eoE"  +  Co  XI  E"-'"  /  X(r)  dr. 

_ n  Jm6i 


Likewise, 


D"+i  =  eoE"+i  +  fo  X  E"-"*+i  /  x(7-)  dr. 

m=zO 


The  discretization  of  Ampere’s  law,  as  given  by  Eqn.  (2),  requires  us  to  take  the  difference  of  the  two 
preceding  equations.  Doing  so  and  replacing  that  difference  into  Ampere’s  law,  we  obtain 

E"+^  =  aE"  +  +  —  V  X  (39) 


where 


X"'  =  /  X{r)  dr. 

Jmdi 


Performing  the  calculation  for  and  in  conjunction  with  Eqn.  (23),  we  obtain 


Ax'"  =  -  (l-e""*')^  =  Ax°e-'"‘'^‘. 
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Although  Maxwell’s  equations  are  temporally  discretized,  they  are  not  in  a  form  suitable  for  computation. 
For  we  note  that  to  compute  Eqn.  (40),  one  would  have  to  store  the  complete  time  history  of  E.  However, 
given  the  exponential  form  of  Ax"*, 


n  — 1 

m=0 


whence. 


=  Ax°E”  + 


Thus,  the  equation  for  ^  reduces  to  a  simple  recursive  relationship. 


(46) 


(47) 


4.7  Recursive  Convolution  Method  Two:  CP-RCM2 


Numerical  accuracy  of  CP-RCMl  can  be  increased  by  assuming  piecewise  linear  line  segments  in  the  integra¬ 
tion  of  the  convolution  integral  [20].  For  this  situation,  one  approximates  E  over  the  interval  [n6t,{n  +  l)(5t] 
with 


E(x,t)  =  E(x,  n6i)  +  {t-  nSt)  +  E(x,TigO^ 

Substitution  of  this  approximation  into  Eqn.  (36)  yields  the  following  result: 


n— 1 


n-1 


D"  =  CoE"  -f  fo  ^  E"-'”x’”  +  Co  ^  (E”-'"-i  -  E"-'")  r, 


m-0 


m=0 


where  x"*  is  given  by  Eqn.  (43)  and 


1  + 

=  7-  /  (t-  -  mSt)x{r)  dr. 

Jm6, 


(48) 


(49) 


(50) 


As  before,  the  time  derivative  of  D  at  time  {n+l/2)6t  is  approximated  to  second-order  by  (D"+^  -  'D’^)/St. 
From  Eqn.  (2)  and  Eqn.  (49), 


E"+i  =  a(l  -  ^°)E"  -h  —V  X  ^ 


Here 


n-l 


E"“'"Ax'"  +  (E"“'"“^  -  E"“”’)A^'’ 
1 


m=0 


a  = 


(l  +  X°-^°) 


and 


(51) 

(52) 

(53) 

(54) 


The  expression  for  Ax'"  is  given  by  Eqn.  (45). 

For  the  time-domain  susceptibility  function  given  by  Eqn.  (23),  it  is  a  simple  matter  to  show  that 


_  _ £_ 


[f+— (' 


g-vi,  I 


(55) 
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Method 

Numerical  Permittivity 

:  i 

CP-DIM  1 

ai^ii 

bhi^i 

CP-DIM2 

e  =  Co 

fi  1  1 

CP-DIM3 

e  =  Co 

CP-DIM4 

^  1  8in[(w— j*/)5</2]— nsinh(»/^</2)} 

8in[( w  —  ) 5 <  / 2] 

CP-DIM5 

e  =  Co 

CP-RCMl 

e--eo^l-h  \  2mt8int(w-ji/)«</2] j 

CP-RCM2 

rai 

'  ^  2^2jt  8in[((j 

Table  1:  Numerical  permittivities  for  various  schemes  associated  with  propagation  in  an  isotropic  cold 
plasma. 

and  ^ 

^  ~  i)  ~  .  (56) 

Finally,  the  summation  over  all  time  in  Eqn.  (52)  is  replaced  with  a  recursive  factor  -  to  wit, 

=  Ax°E"  +  (E"“i  »  E”)A^°  +  (57) 

The  expression  for  Ax^  is  given  by  Eqn.  (45)  when  m  =  0. 

Although  the  previous  equation  appears  to  require  an  additional  storage  location  for  E”“\  such  is  not 
the  case.  As  pointed  out  by  Kelley  ei  al  [20],  a  low  storage  implementation  is  possible  by  making  judicious 
use  of  a  temporary  variable. 

4.8  Comparative  Analysis 

The  numerical  errors  of  each  of  the  aforementioned  schemes  are  deduced  in  a  similar  way  as  in  Section  Three. 
Assuming  again  plane  wave  propagation  and  an  unbounded  domain,  the  numerical  dispersion  relationship 
is  similar  to  that  given  by  Eqn.  (17).  Of  course,  each  scheme  produces  a  different  e,  as  seen  in  Table 
1.  Note  the  equivalence  between  CP-DIM  1  and  CP-DIM5.  Even  though  both  of  these  procedures  are 
derived  from  different  equations,  the  final  numerical  permittivities  are  identical.  However,  as  seen  in  Table 
3,  CP-DIM5  requires  more  memory.  Next,  as  pointed  out  by  Cummer  [17],  there  exists  an  equivalence 
between  exponential  differencing  (i.e.,  CP-DIM4)  and  the  recursive  convolution  methods;  this  observation  is 
confirmed  in  Table  1 ,  where  it  is  seen  that  the  numerical  permittivities  bear  some  semblance  to  one  another. 
Finally,  if  =  A^°  =  0,  CP-RCMl  and  CP-RCM2  are  one  and  the  same,  as  expected. 

To  appreciate  the  subtle  differences  between  each  of  the  schemes,  Table  2  is  provided.  In  this  table,  each 
entry  corresponds  to  the  leading  term  of  the  Taylor  series  of  the  function  As  seen  from  Eqn.  (17),  the 

function  contains  the  complete  temporal  information  of  the  numerical  scheme.  Except  for  CP-RCMl, 
these  numerical  permittivity  relationships  reduce  to  the  exact  analytical  permittivity  relationship  to  the 
order  of  Sf;  CP-RCMl  is  temporally  accurate  to  first-order.  Of  course,  for  certain  choices  of  w,  u)p,  v  6t, 
the  scheme  with  lowest  truncation  term  is  deemed  temporally  best,  from  an  accuracy  point  of  view. 
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Method 

Truncation  Term 

Truncation  Term:  u  =  0 

CP-DIMl 

12 

CP-DIM2 

12 

CP-DIM3 

12 

CP-DIM4 

12 

CP-DIM5 

. 

—W  Of 

12 

CP-RCMl 

p  „„ 

uf'iwSt 

2j 

CP-RCM2 

12 

Table  2:  Truncation  term  from  the  Taylor  analysis  of  fi^e/eo:  Cold  plasma. 


A  special  case  of  interest  is  when  u  =  0.  From  Table  2,  it  is  readily  seen  that  CP-DIMl,  CP-DIM2  and 
CP--DIM5  all  have  the  same  leading  truncation  term,  which  is  independent  Up.  CP-DIM3  and  CP-DIM4 
also  have  the  same  truncation  term;  the  truncation  term  for  CP-RCM2  is  close  in  value  to  that  of  CP- 
DIM3  and  CP-DIM4.  Since  the  truncation  term  for  CP-DIM3,  CP-DIM4  and  CP-RCM2  are  dependent 
on  the  truncation  terms  are  smaller  than  those  associated  with  CP-DIMl,  CP-DIM2  and  CP-DIM5. 
Moreover,  there  exists  a  frequency  for  which  the  truncation  terms  of  CP-DIM3,  CP-DIM4  and  CP-RCM2 
vanish.  Finally,  if  <<  1,  then  all  of  the  second  order  schemes  produce  a  truncation  term  of  /12f 

which  is  the  truncation  term  of  the  standard  FDTD  algorithm. 

Other  properties  of  these  algorithms,  including  memory  and  one-dimensional  CFL  equations  are  shown 
in  Table  3  [17].  As  expected  from  the  previous  discussion  on  the  truncation  terms,  CP-DIMl,  CP-DIM2 
and  CP-DIM5  have  the  same  stability  equations,  which  are  dependent  on  the  plasma  frequency,  when  2/  =  0, 
When  2/^0,  CP-DIM2  is  unconditionally  unstable.  CP-DIM3  and  CP-DIM4  have  a  CFL  of  unity.  This 
is  to  be  expected  since  the  leading  truncation  term  of  for  CP-DIM3  and  CP-DIM4  contains  the 

factor  {ujp6t/2y,  which  is  also  the  same  factor  that  appears  in  the  CFL  equation  for  CP-DIMl,  CP-DIM2 
and  CP-DIM5.  As  for  CP-RCMl  and  CP-RCM2,  the  CFL  is  found  experimentally  to  be  near  unity  [17]. 
From  a  memory  point  of  view,  CP-DIM2  and  CP-DIM5  require  additional  memory  (compared  to  the  other 
schemes)  for  the  back  storage  of  a  vector. 

5  Debye  Dielectric 

The  Debye  dielectric  has  received  considerable  attention  in  the  literature  due  to  the  fast  growing  interest 
in  electromagnetic  wave  interactions  with  biological  and  water-based  substances  [2li-[23].  Given  the  water 
content  of  these  substances,  they  are  well  modeled  by  a  combination  of  single-pole  models,  which  account  for 
permanent  dipole  moments  established  within  the  water  [3].  Moreover,  since  the  Debye  model  is  consistent 
with  the  requirements  associated  with  the  Kramers-Kronig  relationships,  the  model  inherently  satisfies 
causality  requirements.  For  this  reason,  an  M-pole  model  is  often  used  to  construct  an  analytical  model 
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Method 

3D  Memory/Cell 

ID  Stability 

CP-DIM  1 

9 

it  <  iSx/c)\/l  -  (Wp^t/2)2;  i/  =  0 

CP-DIM2 

12 

it  <  (ix/c)^!  -  {ijp6t/2y-,  1^  =  0 

CP-DIM3 

9 

Sf  <  6x/c 

CP-DIM4 

9 

Si  ^  Sx  /  C 

CP-DIM5 

12 

f  yif  fffTTri  B  ffi  1  Ul 

CP-RCMl 

9 

6^“^  w  i^/c 

CP-RCM2 

9 

6"'“=^  «  i^/c 

Table  3:  Comparison  table  of  various  schemes  in  terms  of  memory  per  cell  and  stability  equations:  Cold 
plasma. 


from  empirical  data.  Since  in  the  frequency-domain  the  permittivity  of  the  Debye  dielectric  is  given  by 

M  _ 

?  =  +  eo  £ 


(58) 


the  modeling  task  is  to  deduce  values  for  r*,  Coo  and  that  best  fit  i  to  the  data  [24].  Here  is  the  kth 
relaxation  time,  €©600  is  the  value  of  e  at  infinite  frequency  and  is  the  zero  frequency  permittivity  of 
the  kth  relaxation.  Note:  In  the  time-domain, 


x{t) 


M  ,  ^ 

~  ^00  \ 


“t/rjt 


U{t). 


(59) 


The  Debye  dielectric  may  also  be  characterized  in  state  variable  form  (excluding  Faraday’s  law)  in  terms 
of  the  polarization  vector  P  [25]: 


^  1 

eofoo-^  =  V  X  H  -  ^  -[(e,fc  -  e„)eoE  -  P^] 


k=l 


and 


^  =  -[(€,* -€oc)eoE- Pi], 
ot  Tk 


(60) 


(61) 


where  A:  =  1, 2,  •  •  •,  M  in  the  last  equation  (Note:  The  subscript  k  denotes  the  pole  number,  not  the  spatial 
index  of  z.).  These  latter  two  equations  form  the  basis  for  two  of  the  direct  integration  schemes. 

5.1  Direct  Integration  Method  One:  D-DIMl 

For  the  first  direct  integration  technique,  consider  initially  a  single-pole  model.  Then 


and 


1 

Cofoo^  =  V  X  H - [(e,  -  eoo)eoE  -  P] 

at  T 


^=i[(e,-eoo)eoE-P]. 


(62) 

(63) 
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The  components  of  P  and  E  are  conjoined  on  the  Yee  grid  and  the  discretization  of  the  curl  operators  is 
accomplished  in  the  usual  manner.  Regarding  the  temporal  discretization,  we  invoke  the  notion  of  the  central 
difference  and  average  approximations  and  the  leap-frog  time  advancement  [26].  To  this  end, 


pn+l/2  _  pn-1/2  if  / 

~  ~  foo)eoE"  -  ^ 


pr»’fl/2  _|_  pn-‘l/2> 
2  . 


ana 

--  =  V  .  -  1  [«.  -  (5!!^)  -  P-./=]  .  (65) 

The  proceeding  equations  can  be  algebraically  arranged  so  that  only  and  appear  on  the  left- 

hand  side  of  the  equations;  see  [26].  Note  that  in  this  implementation,  the  polarization  vector  P  and  the 
magnetic  field  vector  H  are  advanced  simultaneously  and  leaped  with  E. 

The  basic  strategy  for  M-poles  is  essentially  the  same  as  before,  with  the  exception  that  all  of  the 
polarization  equations  are  advanced  simultaneously  with  Faraday’s  law  [26].  When  temporal  averaging  is 
invoked,  the  discretized  temporal  form  becomes 


pn  +  l/2  _  pn-l/2  ^  ' 

— - 7—^ -  =-  -  eoo)€oE" 

Ot  Tk 


pn+l/2  p^-1/2 


(^) = V  ^  -  t  ^  [(..  -  (^)  -  pr-] . 

Further  manipulation  of  Eqns.  (66)  and  (67)  leads  to  the  following  update  equations  for  P  and  E; 


where 


+  l3kSiT-\e,k  -  eoo)eoE", 


2rfc  -  St 
StJ;  +  Si  ' 


+  St  ’ 

M 

E"+1  =  ciE"  +  c^SiV  X  +  c^Si 


where 


Cl  =  e, 


2  r. 


■  r  jc  ^ 

J  L  k  =  l 


^sk  Cq 


C2  -  gooCo  +  —/  . 


^  ~  goo)go 

2  ^  n 
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5.2  Direct  Integration  Method  Two:  D-DIM2 


To  have  greater  control  over  the  stability  properties  of  the  scheme,  semi-implicit  schemes,  such  as  those 
proposed  by  Kashi wa  [25]  and  Petropolous  [27]  must  be  considered.  (The  reader  may  also  consult  [28]  for 
further  information  on  the  semi-implicit  approach.)  For  example,  for  Mth-order  media, 


/E"+1-E"\  A/P*"^  -Pfc^  „  „„ 


(^) = ^  [<-  -  (^)  -  (t)]  ‘  <-> 

here  fc  =  1, 2,  •  •  •, M. 

5.3  Direct  Integration  Method  Three:  D-DIM3 

Before  leaving  this  section  on  direct  integration  techniques,  we  wish  to  consider  an  integration  technique 
based  upon  the  frequency  domain  permittivity  relationship:  D  =  eE  [18].  As  an  example,  consider  the 
technique  for  a  one-pole  Debye  dielectric.  For  this  situation,  the  constitutive  relationship  can  be  written  as 

D  =  eo  L  +  E-  (76) 

[  l+jwrJ 


Or,  in  the  time-domain, 


5D  dE 

D  +  T—  =  e,eoE  +  ^oo^oT-^. 


As  before,  central  differences  and  averages  are  applied  to  the  previous  ordinary  differential  equation,  whereby 
an  update  equation  for  E  is  obtained.  Using  this  update  equation  and  the  corresponding  update  equation 
for  Ampere’s  law,  we  obtain 

D"+i  =d”  +  5,V  X  (78) 


=  f  +  N  pn+i  ,  (  E"  (^79') 

Vco(2reoo  +  Veo(2reoo  +  \2reoo  +  e»^«y 

Note:  The  update  equation  for  must  be  rederived  for  each  additional  pole  considered  in  the  dielectric 

model. 


5.4  Recursive  Convolution  Method  One:  D-RCMl 

The  formulation  of  the  recursive  convolution  method  follows  one-for-one  with  that  of  CP-RCMl.  The 
only  difference  in  that  formulation  and  the  one  presented  below  is  the  equation  for  the  accumulator,  since 
the  time-domain  susceptibility  function  is  different.  To  account  for  Cqoj  the  first-order  update  equation  is 
rewritten  as  [2 9] 

E"+^  =  CoottE"  +  +  —V  X  (80) 

^0 


where 


eoo+X° 
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here 


X°  =  (f.-€oo)(l-e-'‘/^). 

=  Ax^E"  + 

Ax°  =  X°  (l  -  . 

5.5  Recursive  Convolution  Method  Two:  D-RCM2 


With  respect  to  the  accumulator, 
where 


(82) 

(83) 

(84) 


To  improve  the  temporal  accuracy,  a  piecewise  linear  model  may  be  employed  in  the  integrand  of  the 
convolution  integral  [20].  (See  also  CP-RCM2  for  the  cold  plasma.)  The  final  equations  are 


E"+i  =  a(eoo  -  ^“)E”  +  —V  x  ^ 


where 


and 


a  — 


(eoo+X°-n 


=  Ax°E"  +  (E”"^  -  E")A^°  +  6“^'/’'’^"“^ 

Here  and  Ax°  are  given  by  Eqns.  (82)  and  (84),  respectively.  As  for  and  A^°, 


and 


(85) 

(86) 

(87) 

(88) 

(89) 


5.6  Comparative  Analysis 

As  with  the  cold  plasma,  Table  4  documents  the  numerical  permittivities  for  the  previously  reported  schemes. 
From  this  table,  the  equivalence  between  D-DIM2  and  D-DIM3  is  seen.  However,  since  D-DIM3  is  con¬ 
structed  from  the  frequency-domain  constitutive  equation,  D-DIM3  requires  more  memory  than  D-DIM2, 
Finally,  D-RCMl  is  first-order  accurate  in  time;  all  other  schemes  are  temporally  second-order  accurate. 

To  appreciate  many  of  the  subtle  features  of  each  of  the  aforementioned  schemes.  Tables  5  and  6  are 
given.  These  tables  show  the  leading  truncation  term  of  the  function  Q^i/co.  From  a  temporal  accuracy 
point  of  view,  the  scheme  with  the  lowest  truncation  term  for  a  given  a;,  r,  Cg,  Coo  and  6t  is  deemed  best. 
Consider  Table  6,  which  shows  the  leading  truncation  term  for  the  two  cases  cjr  <<  1  and  ujt  »  1.  Except 
for  D-RCMl,  all  other  schemes  have  identical  high-frequency  truncation  terms.  These  terms  are  the  same 
truncation  terms  that  the  standard  FDTD  algorithm  would  produce  in  a  dielectric  of  permittivity  Coo^o* 
the  low-frequency  limit  only  D-DIM2  and  D-DIM3  predict  the  anticipated  truncation  term.  Thus  it  can  be 
argued  that  D-DIM2  and  D-DIM3  are  the  most  compatible  with  the  standard  FDTD  algorithm. 

A  summary  of  the  stability  properties  and  memory  requirements  of  the  various  methodologies  is  provided 
in  Table  7.  Here  we  note  that  D-DIM2  is  the  most  memory  intensive.  With  respect  to  the  stability  equations, 
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Method 

Numerical  Permittivity:  e 

D-DIMl 

D-DIM2 

e  =  Ccxjeo  +  (e,  -eoo)eo 

A 

A+j  ClT 

D-DIM3 

e  =  Coofo  +  (e»  -  eoo)eo 

A 

A+jOt 

D-RCMl 

t  —  to  Coo  1  2jQ6t8\nh[{6t/r){l+jwr)/2] 

D-RCM2 

f  r  r  1  \  6tl(2T^ 

0  oo  <^  \  \2jQ6i8\nh[{6i/T){l‘\‘jwT)/2] ) 

Table  4:  Numerical  permittivities  for  various  schemes  associated  with  propagation  in  single-pole  Debye 
dielectric. 


Method 

Truncation  Term 

D-DIMl 

12 

D-DIM2 

12 

f  (j— 

D-DIM3 

12 

(foo  f«)  [j-UT  +  O-Ut)^) 

D-RCMl 

2  1+jwT 

D-RCM2 

12 

j(eoo“2c,)+6oot*^T 

J— WT 

Table  5:  Truncation  term  from  the  Taylor  analysis  of  Debye  dielectric. 


Method 

Truncation  Term:  ur  «  1 

Truncation  Term:  wr  »  1 

D-DIMl 

(^)  [(^'>  “  ^oo)  (2  +  If )  -  foo] 

^u;*6tCoo 

12 

D-DIM2 

—  £g 

12 

^W*6'Uoo 

D-DIM3 

—u)^Sfe, 

12 

-W*6^€oo 

12 

D-RCMl 

{^)ie,-eoo) 

~  ^oo) 

D-RCM2 

{^)Voo-2e,) 

-a)^5?eoo 

12 

Table  6:  Truncation  term  from  the  Taylor  analysis  of  for  two  special  cases:  Debye  dielectric. 


Method 

3D  Memory/Cell 

ID  Stability 

9 

ITfffrittl 

D-DIM2 

9 

^  yj^oo^x! C 

D-DIM3 

12 

^  y/^oo  ^ 

D-RCMl 

9 

irffTmii 

D-RCM2 

9 

g'yynTmi 

Table  7:  Comparison  table  of  various  schemes  in  terms  of  memory  per  cell  and  stability  equations:  Single¬ 
pole  Debye  dielectric. 
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the  entries  for  D-DIMl,  D-RCMl  and  D-RCM2  have  not  been  rigorously  established.  The  entry  for  D- 
DIM3  follows  directly  from  the  semi-implicit  nature  of  the  scheme  [30].  Finally,  since  D-DIM2  and  D-DIM3 
are  equivalent  to  one  another,  they  also  have  the  same  stability  equation. 

To  close  this  section,  a  comment  on  multi-pole  Debye  dielectrics  is  in  order.  Each  of  the  schemes 
reviewed  herein  are  capable  of  modeling  the  multi-pole  dielectric.  Some  schemes  like  D-DIMl,  D-RCMl 
and  D-RCM2  can  be  directly  extended  without  any  special  treatment.  Other  schemes  like  D-DIM3  require 
a  new  derivation  for  each  new  pole  added  to  the  model.  As  for  D-DIM2,  the  semi-implicit  nature  requires 
the  loading  and  the  inverting  of  a  new  matrix  (each  time  a  pole  is  added)  to  find  the  necessary  coefficients 
of  the  algorithm. 

5.7  Parameter  Selection 

To  set  the  parameters  6t  and  for  the  simulation,  consider  the  example  in  which  muscle  is  the  medium. 
For  a  two-pole  model,  =  40,  =  3948,  e,2  =  59.09,  n  =  46.25  ns  and  -  .0907  ns  [23].  One 

possible  way  to  deduce  a  value  for  6i  is  to  make  it  a  fraction  of  the  smallest  relaxation;  for  this  example, 
let  6t  =  T2/IO  =  9.07  ps.  To  minimize  dispersion  errors,  the  CFL  number  should  be  maximized.  Thus, 
from  the  definition  of  CFL  let  =  .430  mm,  where,  for  this  example,  =  4.74  x  10^  m/s.  A 

second  way  is  to  set  6^  in  accordance  with  the  minimum  wavelength  of  interest.  Let  Sj;  =  A"**”/20,  where 
yntn  'g  minimum  wavelength  at  some  angular  frequency;  for  this  example  let  uj  =  10/r2,  in  which  case 
yntn  _  2.69  mm  and  Sx  =  .135  mm.  Finally,  from  the  stability  relationship,  the  maximum  time  step  is 
St  =  6x/v^^^  =  2.84  ps. 

Based  upon  the  previous  example,  the  more  conservative  choice  for  the  values  of  Sx  and  St  is  based  upon 
the  minimum  wavelength  argument.  Unfortunately,  this  argument  produces  a  time-step  that  over-resolves 
the  time-domain  driving  signal.  Moreover,  by  doing  so,  we  incur  the  cost  of  increasing  the  simulation  time 
by  about  a  factor  of  3.2  and  the  computational  memory  by  a  factor  of  32.8  (three-dimensional  domain), 
as  compared  with  the  other  method.  Had  we  chosen  the  first  method,  the  high-frequency  information  of 
the  medium  would  not  be  capture  and  hence,  the  early-time  information  of  the  output  data  would  be  in 
error.  One  possible  way  to  resolve  these  contrary  specifications  is  to  increase  the  spatial  accuracy  order  of 
the  scheme  [27].  For  example,  if  a  fourth— order  in  space  and  a  second— order  in  time  scheme  is  employed, 
the  sampling  requirements  on  the  minimum  wavelength  can  be  relaxed  and  the  resulting  time-step  will  be 
commensurate  with  the  smallest  relaxation. 

6  Summary 

/ 

A  survey  of  some  of  the  more  popular  schemes  associated  with  FDTD  analysis  of  electromagnetic  wave 
propagation  in  complex  media  has  been  given.  These  schemes  have  been  characterized  in  terms  of  their 
numerical  permittivities  in  conjunction  with  a  numerical  dispersion  relationship.  In  addition,  the  CFL 
number  of  each  scheme  is  given  or  postulated  when  such  information  has  been  provided  in  the  literature.  In 
some  cases,  discussions  on  memory  requirements  have  also  been  provided. 

It  is  not  the  intention  of  this  report  to  rank  such  schemes  in  terms  of  best  or  worst.  Such  terms  are 
subjective,  since  best  or  worst  designators  require  one  to  quantify  both  the  tangible  (e.g.,  accuracy)  and 
intangible  (e.g.,  simplicity)  features  of  a  scheme  into  a  single  number.  Instead,  each  scheme  should  be 
analyzed  with  a  particular  application  in  mind  and,  from  that  analysis,  an  informed  selection  can  be  made. 
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